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A method for obtaining structural information concerning an encoded 
molecule and method for selecting compounds 

Technical Field of the Invention 

5 In one aspect, the present invention relates to a method for obtaining struc- 
tural information about an encoded molecule. The encoded molecule may be 
produced by a reaction of a plurality of chemical entities and may be capable 
of being connected to an identifier oligonucleotide containing codons informa- 
tive of the identity of the chemical entities which have participated in the for- 

10 mation of the encoded molecule. In a certain embodiment, primers are de- 
signed complementary to the codons appearing on the identifier oligonucleo- 
tide, and the presence, absence or relative abundance of a codon is evalu- 
ated by mixing a primer with the identifier oligonucleotide in the presence of a 
polymerase and substrate (deoxy)ribonucleotide triphosphates and measur- 

15 ing the extension reaction. In another aspect, the invention provides a 

method for selecting compounds which binds to a target. More specifically, 
the invention relates to a method in which a target associated with 4 an oli- 
gonucleotide initially is mixed with a library of complexes, each complex 
comprising a display molecule and an oligonucleotide identifying said display 

20 molecule. Next, due an increased proximity, the target oligonucleotide is 

coupled to the identifier oligonucleotide of complexes having a display mole- 
cule with affinity towards the target. In a final stage the coupled nucleotides 
are analysed to deduce at least the identity of the display molecule. 

25 A method for obtaining structural information concerning an encoded 
molecule 

The below paragraphs up to the section entitled "Method for identifying a 
display, molecule" relate to the first aspect of the invention. 

30 The first aspect of the present invention relates to a method for obtaining 
structural information about an encoded molecule. The encoded molecule is 
usually produced by a process that comprises the reaction of a plurality of 



WO 2005/026387 PCT/DK2004/000630 

2 

chemical entities. The synthesis of the encoded molecule is recorded or 
programmed in an identifier oligonucleotide which is attached to the encoded 
molecule. The structural information obtained by the present method may be 
used to obtain the entire structure of the encoded molecule or a part thereof. 

5 . 

Background of the invention 

The generation of molecules carrying new properties remains a challenging 
task. Recently, a number of procedures have been suggested that should 
allow a more efficient generation and screening of a huge number of mole- 

10 cules. The approach taken may involve the encoding and/or templating of 
molecules other than natural biopolymers and a coupling of the molecules to 
respective templates or identifier parts containing information about the reac- 
tants that have participated in the formation of the molecule. These ap- 
proaches allow the researcher to generate and screen a huge number of 

15 molecules at the same time. 

In US 5,723,598 it is suggested to prepare libraries of Afunctional molecules, 
in which one part of the bifunctional molecule comprises an encoded part and 
the other part of the molecule contains an identifying part. The identifying part 

20 is segregated into codons, i.e. a stretches of nucleotides, which codes for 
reactants that have been involved in the synthesis of the encoded molecule. 
The libraries of bifunctional molecules are generally prepared by a split-and- 
mix method, which involves the initial reaction between a nascent bifunctional 
molecule and a range of different reactants in separate compartments at one 

25 end of the nascent bifunctional molecule and a corresponding range of identi- 
fier unit oligonucleotides (codons) and the other end. Subsequently, the con- 
tents of the compartments are mixed and the mixture is disposed in separate 
compartments and reacted again with another range of reactants and corre- 
sponding codons. Following the generation of a library of the bifunctional 

30 molecules, a partitioning with respect to affinity towards a target is conducted 
and the identifier part of the bifunctional molecule is decoded to establish the 
chemical structure of the compounds in the library that is likely to be a ligand 
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to the target The decoding step implies that the identifier oligonucleotides 
initially are amplified by PCR. The PCR product is subsequently incorporated 
in to a suitable vector which is transformed to a host organism, usually E. 
coli. Following the incubation of the E. coli, colonies are picked and se- 
5 quenced. 

Halpin and Harbury have in WO 00/23458 suggested an improvement to the 
approach stipulated above. The approach is based on the same split-and-mix 
strategy for synthesis of combinatorial libraries comprising two or more syn- 

10 . thetic steps. A plurality nucleic acid templates are used, each having at one 
end a chemical reactive site and dispersed throughout the stand a plurality of 
codon regions, each of said codon regions in turn specifying different codons. 
Separately, each of the strands, identified by a first codon region, is reacted 
at the chemical reaction sites with specific selected reagents. Subsequently, 

15 all the strands are pooled and subjected to a second partitioning based on a 
second codon region. The split-and-combine method is conducted an appro- 
priate number of times to produce a library of typically between 10 3 and 10 6 
different compounds. The decoding is performed utilizing the process de- 
picted above. 

20 

Recently, a new method for encoding molecules has been suggested, which 
can be performed in a single "pot". WO 02/00419 and WO 02/103008 dis- 
close methods for preparing virtually any molecule connected to a template 
coding for chemical entities which have reacted to form the molecule. In 

25 short, a template segregated into a plurality of codons and a plurality of build- 
ing blocks comprising a transferable chemical entity and an anticodon are 
initially provided. Under hybridisation conditions, the template and building 
blocks are annealed together and the chemical entities are subsequently re- 
acted to form the molecule. However, after a sufficient number of rounds of 

30 selections have been performed, the template must be decoded to establish 
the identity of the encoded molecule. The decoding step implies that the 
template oligonucleotides initially are amplified by PCR. The PCR product is 
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subsequently incorporated in to a suitable vector which is transformed to a 
host organism, usually E. coli. Following the incubation of the E. coli, colonies 
are picked and sequenced. 

5 In an aspect of the invention, it is the object to facilitate the decoding of the 
coding oligonucleotide in order to obtain at least partial structural information 
of the encoded molecule being a ligand towards a target. In another aspect of 
the invention, it is desired to obtain information about which chemical entities 
that result in encoded molecules successful in a selection process. Such 
10 chemical entities may be used in the formation of a second generation li- 
brary. 

Summary of the Invention 

The first aspect of the present invention concerns a method for obtaining 
15 structural information about an encoded molecule produced by a process 
comprising reaction of a plurality of chemical entities, said encoded molecule 
being capable of forming part of a complex also comprising an identifier oli- 
gonucleotide containing codons informative of the identity of chemical entities 
which have participated in the formation of the encoded molecule, the 
20 method comprises mixing a primer oligonucleotide with the identifier oligonu- 
cleotide, subjecting the mixture to a condition allowing for an extension reac- 
tion to occur when the primer is sufficient complementary to a part of the 
identifier oligonucleotide, and evaluating, based on measurement of the ex- 
tension reaction, the presence, absence, or relative abundance of one or 
25 more codons. 



The method according to the invention may be performed on a single identi- 
fier oligonucleotide or a composition of identifier oligonucleotides to obtain 
structural information about the encoded molecule or a composition of en- 
30 coded molecules, respectively, that have been attached to the identifier oli- 
gonucleotide^). 



WO 2005/026387 PCT/DK2004/000630 

5 

A single identifier may be analysed using the above method to verify the 
incorporation into the encoded molecule of one or more chemical entities or 
to deconvolute the identity of the entire encoded molecule. A composition of 
two or more identifier oligonucleotides generally results from a selection 
5 process, i.e. a process involving subjecting a library of different complexes to 
a condition partitioning the composition from the remainder of the library. 
Usually the partitioning condition includes an affinity assay in which the 
library of complexes is contacted with a target and the identifier 
oligonucleotides of the binding complexes are harvested. 

10 

The conditions allowing for an extension reaction to occur may be selected 
from a enzymatic or chemical means. Suitably, the condition involves one or 
more enzymes. In a certain embodiment of the invention, the condition which 
allows for an extension reaction to occur includes a polymerase or a ligase 
15 as well as suitable substrates for the enzyme used. Preferred is a poly- 
merase together with a blend of (deoxy)ribonucleotide triphosphates. Suita- 
bly, the blend include one or more of dATP, dGTP, dCTP, and dTTP. 

A library of complexes can have any appropriate size. Typically, the size is 
20 above 10 3 , typically above 10 6 different complexes. An effective, extensive, 
and rapid decoding is therefore desirable. The method of the present inven- 
tion may be used at various stages of the process of finding a ligand to a cer- 
tain target. As examples, the method of the invention may be used for con- 
trolling the quality of a starting library. The information acquired may be used 
25 to verify which codons being present, absent, and, in some embodiments, 
also the relative abundance. Thus, the method of the invention delivers a re- 
liable picture of the process which has produced the library. If, for some rea- 
son, a chemical entity has not been incorporated into the encoded molecules, 
the absence of a codon for this chemical entity will in certain embodiments of 
30 the invention indicate this fact. 
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Another example of the use of the present method is following the selection. 
After the selection has been performed the codon profile is indicative of the 
chemical entities that have been used in the synthesis of encoded molecules 
having an affinity towards the target. In the event the selection has been suf- 
5 ficiently effective it may be possible directly to deduce a part or the entire 
structure of binding encoded molecules. Alternatively, it may be possible to 
deduce a structural unit appearing more frequently among the encoded 
molecules after the selection, which gives important information to the struc- 
ture-activity-relationship (SAR). If the selection process has not narrowed the 

1 0 size of the library to a manageable number, the formation of a second gen- 
eration library may be contemplated. In the formation of the second genera- 
tion library chemical entities which have not been involved in the synthesis of 
encoded molecules that have been successful in the selection may be omit- 
ted, thus limiting the size of the new library and at the same time increasing 

15 the concentration of binding complexes. The second generation library may 
then be subjected to more stringent selection conditions to allow only the en- 
coded molecules with a higher affinity to bind to the target. The second gen- 
eration library may also be spiked with certain chemical entities suspected of 
increasing the performance of the final encoded molecule. The indication of 

20 certain successful chemical entities may be obtained from the SAR. The use 
in a second generation library of chemical entities, which have proved to be 
interesting for further investigation in a preceding library, may thus entail a 
shuffling with hew chemical entities that may focus the second generation 
library in a certain desired direction. 

25 

The relative abundance of codons may make it possible to decode a plurality 
of identifiers simultaneously, even in the case when two or more identifiers 
contain the same codon. Thus, following the formation and selection of a first, 
second or further generation library, the identity of binding encoded mole- 
30 cules may be partly or entirely deconvolved by the present method. 
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In a practical approach the library comprises complexes with identifier oli- 
gonucleotides having n codon positions each. In a certain aspect of the in- 
vention h is an integer independently selected from of 2 to 8. It may be pre- 
ferred that n is constant among all the complexes in the library to facilitate the 
5 decoding process. Each of the codons in a certain position is in an aspect of 
the invention selected from a set of m different codons. m may vary for each 
codon position or may be constant among the various codon positions. It may 
be preferred in some embodiments to have all the codons in each position 
selected from the same set of m codons. However, in other embodiments, 
10 especially such involving hybridisation in the recognition between the codon 
and the anticodon, it may be preferred that all the codons are different. 

Preferably, any member of the codon set differs from any other codons in the 
set with the identity of at least one nucleotide, i.e. at least one nucleotide po- 

15 sition occurs. In some aspects of the invention it is preferred that any mem- 
ber of the codon set differs with at least two nucleotides nucleotide positions 
from any other member of the set to increase the fidelity of the method. In 
general, it is desired to maximize the differences between individual codons 
of the set. In some embodiments of the invention, a set of primers comprising 

20 a sequence of complementing the set of codons are prepared. 

In a preferred aspect of the method a framing sequence is related to each of 
the n codon positions in a particular complex, said framing sequence posi- 
tions the reaction of a chemical entity in the synthesis history of the encoded 

25 molecule. Typically, the framing sequence is identical among the complexes 
for each of the reaction rounds and is selected from a group of n different 
nucleotide sequences. In a certain aspect of the invention n x m different 
primers fully or in part complementing any combination of the set of m differ- 
ent codons and the set of n different framing sequences is prepared. The n x 

30 m primers may be used in separate compartments to reveal the identity of a 
chemical entity as well as the point in time of the synthesis of the encoded 
molecule is has reacted. 
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In a particular aspect, the invention concerns a method for identifying the 
chemical entities utilized in the formation of an encoded molecule or a com- 
position of encoded molecules, wherein in separate compartments, n x m 
5 primers individually are mixed with an aliquot of a composition obtained by 
subjecting a library of different complexes to a condition partitioning said 
composition from the remainder of the library, subjected to a mixture of poly- 
merase and substrate (deoxy)ribonucleotide triphosphates under conditions 
allowing for an extension reaction to occur when a primer is sufficient com- 
10 plementary to a part of one or more identifier oligonucleotides present in the 
aliquot, and evaluation, based on measurement of the extension reaction, the 
presence, absence, or relative abundance of one or more codons in each 
compartment 

15 The invention also concerns a set comprising a collection of oligonucleotide 
primers, a polymerase, a composition of (deoxy)ribonucleotide triphosphates 
(dNTPs), and a library of complexes composed of a display molecule part 
and an identifier oligonucleotide, said oligonucleotide comprising codons in- 
formative of the identity of the chemical entities which has participated in the 

20 formation of the display molecule, wherein the oligonucleotide primers are 

sufficiently complementary to codons appearing on the identifier oligo nucleo- 
tides in the library to allow for an extension to occur. 

Detailed Description of the Invention 

25 Complex 

The complex comprises an encoded molecule and an identifier oligonucleo- 
tide. The identifier comprises codons that identify the encoded molecule. 
Preferably, the identifier oligonucleotide identifies the encoded molecule 
uniquely, i.e. in a library of complexes a particular identifier is capable of dis- 
30 tinguishing the molecule it is attached to from the rest of the molecules. 
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The encoded molecule and the identifier may be attached directly to each 
other or through a bridging moiety. In one aspect of the invention, the bridg- 
ing moiety is a selectively cleavable linkage. 

5 The identifier oligonucleotide may comprise two or more codons. In a pre- 
ferred aspect the identifier oligonucleotide comprises three or more codons. 
The sequence of each codon can be decoded utilizing the present method to 
identify reactants used in the formation of the encoded molecule. When the 
identifier comprises more than one codon, each member of a pool of chemi- 
10 cal entities can be identified and the order of codons is informative of the syn- 
thesis step each member has been incorporated in. 

In a certain embodiment, the same codon is used to code for several different 
chemical entities. In a subsequent identification step, the structure of the en- 

15 coded molecule can be deduced taking advantage of the knowledge of dif- 
ferent attachment chemistries, steric hindrance, deprotection of orthogonal 
protection groups, etc. In another embodiment, the same codon is used for a 
group of chemical entities having a common property, such as a lipophilic 
nature, a certain attachment chemistry etc. In a preferred embodiment, how- 

20 ever, the codon is unique i.e. a similar combination of nucleotides does not 
appear on the identifier oligonucleotide coding for another chemical entity. In 
a practical approach, for a specific chemical entity, only a single combination 
of nucleotides is used. In some aspects of the invention, it may be advanta- 
geous to use several codons for the same chemical entity, much in the same 

25 way as Nature uses up to six different codons for a single amino acid. The 
two or more codons identifying the same chemical entity may carry further 
information related to different reaction conditions. 

The sequence of the nucleotides in each codon may have any suitable 
30 length. The codon may be a single nucleotide or a plurality of nucleotides. In 
some aspects of the invention, it is preferred that each codon independently 
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comprises four or more nucleotides, more preferred 4 to 30 nucleotides. In 
some aspects of the invention the lengths of the codons vary. 

A certain codon may be distinguished from any other codon in the library by 
5 only a single nucleotide. However, to facilitate a subsequent decoding proc- 
ess and to increase the ability of the primer to discriminate between codons it 
is in general desired to have two or more mismatches between a particular 
codon and any other codon appearing on identifier oligonucleotide. As an 
example, if a codon length of 5 nucleotides is selected, more than 100 nu- 
10 cleotide combinations exist in which two or more mismatches appear. For a 
certain number of nucleotides in the codon, it is generally desired to optimize 
the number of mismatches between a particular codon relative to any other 
codon appearing in the library. 

15 The identifier oligonucleotide will in general have at least two codons ar- 
ranged in sequence, i.e. next to each other. Two neighbouring codons may 
be separated by a framing sequence. Depending on the encoded molecule 
formed, the identifier may comprise further codons, such as 3, 4, 5, or more 
codons. Each of the further codons may be separated by a suitable framing 

20 sequence. Preferably, all or at least a majority of the codons of the identifier 
are separated from a neighbouring codon by a framing sequence. The fram- 
ing sequence may have any suitable number of nucleotides, e.g. 1 to 20. Al- 
ternatively, codons on the identifier may be designed with overlapping se- 
quences. 

25 

The framing sequence, if present, may serve various purposes. In one setup 
of the invention, the framing sequence identifies the position of the codon. 
Usually, the framing sequence either upstream or downstream of a codon 
comprises information which positions the chemical entity and the reaction 
30 conditions in the synthesis history of the encodedmolecule. The framing se- 
quence may also or in addition provide for a region of high affinity. The high 
affinity region may ensure that a hybridisation event with an anti-codon will 
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occur in frame. Moreover, the framing sequence may adjust the annealing 
temperature to a desired level. 

A framing sequence with high affinity can be provided by incorporation of one 
5 or more nucleobases forming three hydrogen bonds to a cognate nucleo- 
base. Examples of nucleobases having this property are guanine and cyto- 
sine. Alternatively, or in addition, the framing sequence may be subjected to 
backbone modification. Several back bone modifications provides for higher 
affinity, such as 2-O-methyI substitution of the ribose moiety, peptide nucleic 
10 acids (PNA), and 2'4' O-methylene cyclisation of the ribose moiety, also re- 
ferred to as LNA (Locked Nucleic Acid). 

The sequence comprising a codon and an adjacent framing sequence has in 
a certain aspect of the invention a total length of 1 1 nucleotides or more, 

15 preferably 15 nucleotides or more. A primer may be designed to complemen- 
tary to the codon sequence as well as the framing sequence. The presence 
of an extension reaction under conditions allowing for such reaction to occur 
is indicative of the presence of the chemical entity encoded in the codon as 
well as the position said chemical entity has in the entire synthesis history of 

20 the encoded molecule. 

The identifier may comprise flanking regions around the coding section. The 
flanking regions can also serve as priming sites for amplification reactions, 
such as PCR or as binding region for oligonucleotide probe. The identifier 
25 may in certain embodiments comprise an affinity region having the property 
of being able to hybridise to a building block. 

It is to be understood that when the term identifier oligonucleotide is used in 
the present description and claims, the identifier oligonucleotide may be in 
30 the sense or the anti-sense format, i.e. the identifier can be a sequence of 
codons which actually codes for the encoded molecule or can be a sequence 
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complementary thereto. Moreover, the identifier may be single-stranded or 
double-stranded, as appropriate. 

The encoded molecule part of the complex is generally of a structure ex- 
5 pected of having an effect on a target. When the target is of pharmaceutical 
importance, the encoded molecule is generally a possible drug candidate. 
The complex may be formed by tagging a library of different possible drug 
candidates with a tag, e.g. a nucleic acid tag identifying each possible drug 
candidate. In another embodiment of the invention, the molecule formed by a 
10 variety of reactarrts which have reacted with each other and/or a scaffold 

molecule. Optionally, this reaction product may be post-modified to obtain the 
final molecule displayed on the complex. The post-modification may involve 
the cleavage of one or more chemical bonds attaching the encoded molecule 
to the identifier in order more efficiently to display the encoded molecule. 

The formation of an encoded molecule generally starts by a scaffold, i.e. a 
chemical unit having one or more reactive groups capable of forming a 
connection to another reactive group positioned on a chemical entity, thereby 
generating an addition to the original scaffold. A second chemical entity may 

20 react with a reactive group also appearing on the original scaffold or a 
reactive group incorporated by the first chemical entity. Further chemical 
. entities may be involved in the formation of the final reaction product. The 
formation of a connection between the chemical entity and the nascent 
encoded molecule may be mediated by a bridging molecule. As an example, 

25 if the nascent encoded molecule and the chemical entity both comprise an 
amine group a connection between these can be mediated by a dicarboxylic 
acid. A synthetic molecule is in general produced in vitro and may be a 
naturally occurring or an artificial substance. Usually, a synthetic molecule is 
not produced using the naturally translation system in an in vitro process. 

30 

The chemical entities that are precursors for structural additions or 
eliminations of the encoded molecule may be attached to a building block 
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prior to the participation in the formation of the reaction product leading the 
final encoded molecule. Besides the chemical entity, the building block 
generally comprises an anti-codon. In some embodiments the building blocks 
also comprise an affinity region providing for affinity towards the nascent 
5 complex. 

Thus, the chemical entities are suitably mediated to the nascent encoded 
molecule by a building block, which further comprises an anticodon. The anti- 
codon serves the function of transferring the genetic information of the 

10 building block in conjunction with the transfer of a chemical entity. The 

transfer of genetic information and chemical entity may occur in any order. 
The chemical entities are preferably reacted without enzymatic interaction in 
some aspects of the invention. Notably, the reaction of the chemical entities 
is preferably not mediated by ribosomes or enzymes having similar activity. 

15 In other aspects of the invention, enzymes are used to mediate the reaction 
between a chemical entity and a nascent encoded molecule. 

According to certain aspects of the invention the genetic information of the 
anti-codon is transferred by specific hybridisation to a codon on a nucleic 

20 acid template. Another method for transferring the genetic information of the 
anti-codon to the nascent complex is to anneal an oligonucleotide, 
complementary to the anti-codon and attach this oligonucleotide to the 
complex, e.g. by ligation. A still further method involves transferring the 
genetic information of the anti-codon to the nascent complex by an extension 

25 reaction using a polymerase and a mixture of dNTPs. 

The chemical entity of the building block may in most cases be regarded as a 
precursor for the structural entity eventually incorporated into the encoded 
molecule. In other cases the chemical entity provides for the eliminations of 
30 chemical units of the nascent encoded molecule. Therefore, when it in the 
present application with claims is stated that a chemical entity is transferred 
to a nascent encoded molecule it is to be understood that not necessarily all 
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the atoms of the original chemical entity is to be found in the eventually 
formed encoded molecule. Also, as a consequence of the reactions involved 
in the connection, the structure of the chemical entity can be changed when it 
appears on the nascent encoded molecule. Especially, the cleavage resulting 
5 in the release of the entity may generate a reactive group which in a subse- 
quent step can participate in the formation of a connection between a nas- 
cent complex and a chemical entity. 



The chemical entity of the building block comprises at least one reactive 
10 group capable of participating in a reaction which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 
a scaffold associated with the nascent complex. The number of reactive 
groups which appear on the chemical entity is suitably one to ten. A building 
block featuring only one reactive group is used i.a. in the end positions of 
15 polymers or scaffolds, whereas building blocks having two reactive groups 
are suitable for the formation of the body part of a polymer or scaffolds capa- 
ble of being reacted further. One, two or more reactive groups intended for 
the formation of connections, are typically present on scaffolds. Non-limiting 
examples of scaffolds are opiates, steroids, benzodiazepines, hydantoines, 
20 and peptidylphosphonates. 

The reactive group of the chemical entity may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
25 group of the nascent complex through a bridging fill-in group. It is to be un- 
derstood that not all the atoms of a reactive group are necessarily maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection. 

30 The subsequent cleavage step to release the chemical entity from the build- 
ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a chemical reagent or an enzyme. The 
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cleavage results in a transfer of the chemical entity to the nascent encoded 
molecule or in a transfer of the nascent encoded molecule to the chemical 
entity of the building block. In some cases it may be advantageous to intro- 
duce new chemical groups as a consequence of linker cleavage. The new 
5 chemical groups may be used for further reaction in a subsequent cycle, ei- 
ther directly or after having been activated. In other cases it is desirable that 
no trace of the linker remains after the cleavage. 

In another aspect, the connection and the cleavage is conducted as a simul- 
10 taneous reaction, i.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction. In some as- 
pects of the invention, it is appropriate to design the system such that the 
connection and the cleavage occur simultaneously because this will reduce 
the number of steps and the complexity. The simultaneous connection and 
15 cleavage can also be designed such that either no trace of the linker remains 
or such that a new chemical group for further reaction is introduced, as de- 
scribed above. 

The attachment of the chemical entity to the building block, optionally via a 
20 suitable spacer can be at any entity available for attachment, e.g. the chemi- 
cal entity can be attached to a nucleobase or the backbone. In general, it is 
preferred to attach the chemical entity at the phosphor of the intemucleoside 
linkage or at the nucleobase. When the nucleobase is used for attachment of 
the chemical entity, the attachment point is usually at the 7 position of the 
25 purines or 7-deaza-purins or at the 5 position of pyrimidines. The nucleotide 
may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational spaced 
sampled by the reactive group is optimized for a reaction with the reactive 
group of the nascent encoded molecule. 

30 

The encoded molecules may have any chemical structure. In a preferred 
aspect, the encoded molecule can be any compound that may be 
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synthesized in a component-by-component fashion. In some aspects the 
synthetic molecule is a linear or branched polymer. In another aspect the 
synthetic molecule is a scaffolded molecule; The term "encoded molecule" 
also comprises naturally occurring molecules like a-polypeptides etc, 
5 however produced in vitro usually in the absence of enzymes, like ribosomes. 
In certain aspects, the synthetic molecule of the library is a non-a- 
polypeptide. 

The encoded molecule may have any molecular weight. However, in order to 
10 be orally available, it is in this case preferred that the synthetic molecule has 
a molecular weight less than 2000 Daltons, preferably less than 1000 Dalton, 
and more preferred less than 500 Daltons. 

The size of the library may vary considerably pending on the expected result 
15 of the inventive method. In some aspects, it may be sufficient that the library 
comprises two, three, or four different complexes. However, in most events, 
more than two different complexes are desired to obtain a higher diversity. In 
some aspects, the library comprises 1 ,000 or more different complexes, more 
preferred 1,000,000 or more different complexes. The upper limit for the size 
20 of the library is only restricted by the size of the vessel in which the library is 
comprised. It may be calculated that a vial may comprise up to 10 14 different 
complexes. 

Extension reaction 

25 The extension reaction requires a primer, a polymerase as well as a collec- 
tion of deoxyribonucleotide triphosphates (abbreviated dNTP's herein) to 
proceed. An extension product may be obtained in the event the primer is 
sufficient complementary to an identifier oligonucleotide for a polymerase to 
recognise the double helix as a substrate. After binding of the polymerase to 

30 the double helix, the deoxyribonucleotide triphosphates (blend of dATP, 

dCTP, dGTP, and dTTP) are incorporated into the extension product using the 
identifier oligonucleotide as template. The conditions allowing for the exten- 



WO 2005/026387 PCT7DK2004/000630 

17 

sion reaction to occur usually includes a suitable buffer. The buffer may be 
any aqueous or organic solvent or mixture of solvents in which the poly- 
merase has a sufficient activity. To facilitate the extension process the poly- 
merase and the mixture of dNTP's are generally included in a buffer which is 
5 added to the identifier oligonucleotide and primer mixture. An exemplary kit 
comprising the polymerase and the nNTP's for performing the extension 
process comprises the following: 50 mM KCI; 10 mM Tris-HCI at pH 8.3; 1.5 
mM MgCI2 ; 0.001% (wt/vol) gelatin, 200 pM dATP; 200 pM dTTP; 200 pM 
dCTP; 200 pM dGTP; and 2.5 units Thermus aquaticus (Taq) DNA poly- 
10 merase I (U.S. Pat. No. 4,889,818) per 100 microliters (pi) of buffer. 

The primer may be selected to be complementary to one or more codons or 
parts of such codons. The length of the primers may be determined by the 
length of the codons, however, the primers usually are at least about 1 1 nu- 

15 cleotides in length, more preferred at least 15 nucleotides in length to allow 
for an efficient extension by the polymerase. The presence or absence of one 
or more codons is indicated by the presence of or absence of an extension 
product. The extension product may be measured by any suitable method, 
such as size fractioning on an agarose gel and staining with ethidium bro- 

20 mide. 

In a preferred embodiment the admixture of identifier oligonucleotide and 
primer is thermocycled to obtain a sufficient number of copies of the exten- 
sion product. The thermocycling is typically earned out by repeatedly increas- 

25 ing and decreasing the temperature of the mixture within a temperature 
range whose lower limit is about 30 degrees Celsius (30°C) to about 55°C 
and whose upper limit is about 90°C to about 100° C. The increasing and 
decreasing can be continuous, but is preferably phasic with time periods of 
relative temperature stability at each of temperatures favouring polynucleo- 

30 tide synthesis, denaturation and hybridization. 
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When a single complex is analysed in accordance with the present method, 
the result may be used to verify the presence or absence of a specific chemi- 
cal entity during the formation of the display molecule. The formation of an 
extension product is indicative of the presence of an oligonucleotide part 
5 complementary to the primer in the identifier oligonucleotide. Conversely, the 
absence of an extension product is indicative of the absence of an oligonu- 
cleotide part complementary to the primer in the identifier oligonucleotide. 
Selecting the sequence of the primer such that it is complementary to one or 
more codons will therefore provide information of the structure of the en- 
10 coded molecule coded for by this codon(s). 

In a preferred aspect of the invention, in the mixture of the identifier oligonu- 
cleotide and the primer oligonucleotide, a second primer complementary to a 
sequence of the extension product is included. The second primer is also 

15 termed reverse primer and ensures an exponential increase of the number of 
produced extension products. The method using a forward and reverse 
primer is well known to skilled person in the art and is generally referred to as 
polymerase chain reaction (abbreviated PCR) in the present application with 
claims. In one embodiment of the invention the reverse primer is annealed to 

20 a part of the extension product downstream, i.e. near the 3'end of the exten- 
sion product, or a part complementing the coding part of the identifier oli- 
gonucleotide. In another embodiment, the first primer (forward primer) an- 
neals to an upstream position of the identifier oligonucleotide, preferably be- 
fore the coding part, and the reverse primer anneals to a sequence of the 

25 extension product complementing one or more codons or parts thereof. 

The amplicons resulting from the PCR process may be stained during or fol- 
lowing the reaction to ease the detection. A staining after the PCR process 
may be prepared with e.g. ethidium bromide or a similar staining agent. As 
30 an example, amplicons from the PCR process is run on an agarose gel and 
subsequently stained with ethidium bromide. Under UV illumination bands of 
amplicons becomes visible. It is possible to incorporate the staining agent in 



WO 2005/026387 PCT/DK2004/000630 

19 

the agarose gel or to allow a solution of the staining agent to migrate through 
the gel. The amplicons may also be stained during the PCR process by an 
intercalating agent, like CYBR. In presence of the intercalating agent while 
the amplification proceeds it will incorporate in the double helix. The interca- 
5 lation agent may then be made visible by irradiation by a suitable source. 

The intensity of the staining is informative of the relative abundance of a spe- 
cific amplicon. Thus, it is possible to quantify the occurrence of a codon in an 
identifier oligonucleotide. When a library of bifunctional complexes has been 

10 subjected to a selection the codons in the pool of identifier oligonucleotides 
which has been selected can be quantified using this method. As an example 
a sample of the selected identifier oligonucleotides is subjected to various 
PCR amplifications with different primers in separate compartments and the 
PCR product of each compartment is analysed by electrophoresis in the 

15 presence of ethidium bromide. The bands that appear can be quantified by a 
densitometric analysis after irradiation by ultraviolet light and the relative 
abundance of the codons can be measured. 

Alternatively, the primers may be labelled with a suitable small molecule, like 
20 biotin or digoxigenin. A PCR-ELISA analysis may subsequently be performed 
based on the amplicons comprising the small molecule. A preferred method 
includes the application of a solid support covered with streptavidin or avidin 
when biotin is used as label and anti-digoxigenin when digoxigenin is used as 
the label. Once captured, the amplicons can be detected using an enzyme- 
25 labelled avidin or anti-dixigenin reporter molecule similar to a standard ELISA 
format. 

To avoid laborious post-PCR handling steps required to evaluate the ampli- 
cons, it is in a certain embodiment preferred to measure the extension proc- 
30 ess "real time". Several real time PCR processes has been developed and all 
the suitable real time PCR process available to the skilled person in the art 
can be used in the evaluating step of the present invention and are include in 
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the present scope of protection. The PCR reactions discussed below are of 
particular interest. 

The monitoring of accumulating amplicons in real time has been made possi- 
5 ble by labelling of primers, probes, or amplicons with fluorogenic molecules. 
The real time PCR amplification is usually performed with a speed faster than 
the conventional PCR, mainly due to reduced cycles time and the use of 
sensitive methods for detection of emissions from the fluorogenic labels. The 
most commonly used fluorogenic oligoprobes rely upon fluorescent reso- 
.10 nance energy transfer (FRET) between fluorogenic labels or between one 

flourophor and a dark or "black-hole" non-fluorescent quencher (NFQ), which 
disperse energy as heat rather than fluorescence. FRET is a spectroscopic 
process by which energy is passed between molecules separated by 10-100 
A that have overlapping emission and absorption spectra. An advantage of 
15 many real time PCR methods is that they can be carried out in a closed sys- 
tem, i.e. a system which does not need to be opened to examine the result of 
the PCR. A closed system implies a reduced result turnaround, minimisation 
of the potential for carry-over contamination and the ability to closely scruti- 
nise the essay's performance. 

20 

The present real time PCR methods currently available to the skilled person 
can be classified into either amplicon sequence specific or non-specific 
methods. The basis for the non-specific detection methods is a DNA-binding 
fluorogenic molecule. Included in this class are the earliest and simplest ap- 

25 proaches to real time PCR. Ethidium bromide, YO-PRO-1, and SYBR® green 
1 all fluorescence when associated with double stranded DNA which is ex- 
posed to a suitable wavelength of light. This approach requires the fluores- 
cent agent to be present during the PCR process and provides for a real time 
detection of the fluorescent agent as it is incorporated into the double 

30 stranded helix. 
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The amplicons sequence specific methods includes, but are not limited to, 
the TaqMan®, hairpin, LightCycler®, Sunrise®, and Scorpion® methods. The 
LightCycler® method also designated "HybProbes" make use of a pair of ad- 
jacent, fluorogenic hybridisation oligonucleotide probes. A first, usually the 
5 upstream oligoprobe is labelled with a 3' donor fluorophore and the second, 
usually the downstream probe is commonly labelled with either a Light cycler 
Red 640 or Red 705 acceptor fluorophore a the 5' terminus so that when 
both oRgoprobes are hybridised the two fluorophores are located in close 
proximity, such as within 10 nm, of each other. The close proximity provides 

10 for the emission of a fluorescence when irradiated with a suitable light sour- 
ce, such a blue diode in case of the LightCycler®. The region for annealing of 
the probes may be any suitable position that does not interfere with the 
primer annealing. In a suitable setup, the site for binding the probes are posi- 
tioned downstream of the codon region on the identifier oligonucleotide. Al- 

15 tematively, when a reverse primer is used, the region for annealing the 
probes may be at the 3' end of the strand complementing the identifier oli- 
gonucleotide. Another embodiment of the LightCycler method includes that 
the pair of oligonucleotide probes are annealed to one or more codons and 
primer sites exterior to the coding part of the identifier oligonucleotide are 

20 used for PCR amplification. 

The TaqMan® method, also referred to as the 5' nuclease or hydrolysis 
method, requires an oligoprobe, which is attached to a reporter flourophor, 
such as 6-carboxy-fluoroscein, and a quencher fluorophore, such as 6- 

25 carboxy-tetramethyl-rhodamine, at each end. When in close proximity, i.e. 
annealed to an identifier oligonucleotide, or a sequence complementing the 
identifier oligonucleotide, the quencher will "hijack" the emissions that have 
resulted from the excitation of the reporter. As the polymerase progresses 
along the relevant strand, it displaces and the hydrolyses the oligoprobe via 

30 its 5^3' endonuclease activity. Once the reporter is removed from the extin- 
guishing influence of the quencher, it is able to release excitation energy at a 
wavelength that can be monitored by a suitable instrument, such as ABI 



WO 2005/026387 PCT/DK2004/000630 

22 

Prism® 7700. The fractional cycle number at which the real-time fluorescence 
signal mirrors progression of the reaction above the background noise is 
normally used as an indicator of successful identifier oligonucleotide amplifi- 
cation. This threshold cycle (C T ) is defined as the PCR cycle in which the 
5 gain in fluorescence generated by the accumulating amplicons exceeds 10 
standard deviations of the mean base line fluorescence. The C T is propor- 
tional to the number of identifier oligonucleotide copies present in the sample. 
The TaqMan probe is usually designed to hybridise at a position downstream 
of a primer binding site, be it a forward or a reverse primer. When the primer 
10 is designed to anneal to one or more codons of the identifier oligonucleotide, 
the presence of these one or more codons is indicated by the emittance of 
light. Furthermore, the quantity of the identifier oligonucleotides comprising 
the one or more codons may be measured by the C T value. 

15 The Hairpin method involves an oligoprobe, in which a fluorophore and a 

quencher are positioned at the termini. The labels are hold in close proximity 
by distal stem regions of homologous base pairing deliberately designed to 
create a hairpin structure which result in quenching either by FRET or a di- 
rect energy transfer by a collisional mechanism due to the intimate proximity 

20 of the labels. When direct energy transfer by a collision mechanism is used 
the quencher is usually different from the FRET mechanism, and is suitably 
4-(4'-dimethylamino-phenylazo)-benzene (DABCYL). In the presence of a 
complementary sequence, usually downstream of a primer, or within the 
bounds of the primer binding sides in case of more than one a single primer, 

25 the oligoprobe will hybridise, shifting into an open configuration. The fluoro- 
phore is now spatially removed from the quencher's influence and fluores- 
cence emissions are monitored during each cycle. In a certain aspect, the 
hairpin probe may be designed to anneal to a codon in order to detect this 
codon if present on the identifier oligonucleotide. This embodiment may be . 

30 suitable if codons only differs from each other with a single or a few nucleo- 
tides, because is in well-known that the occurrence of a mismatch between a 
hairpin oligoprobe and its target sequence has a greater destabilising effect 
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on the duplex than the introduction of an equivalent mismatch between the 
target oligonucleotide and a linear oligoprobe. This is probably because the 
hairpin structure provides a highly stable alternate conformation. 

5 The Sunrise and Scorpion methods are similar in concept to the hairpin oli- 
goprobe, except that the label becomes irreversible incorporated in to the 
PCR product. The Sunrise method involves a primer (commercially available 
as Amplifluor™ hairpin primers) comprising a 5' fluorophore and a quencher, 
e.g. DABCYL The labels are separated by complementary stretches of se- 

10 quence that create a stem when the sunrise primer is closed. At the 3' termi- 
nus is a target specific primer sequence. In a preferred embodiment the tar- 
get sequence is a codon, optionally more codons. The sunrise primer's se- 
quence is intended to be duplicated by the nascent complementary stand 
and, in this way, the stem is destabilised, the two fluorophores are held apart, . 

15 usually between 15 and 25 nucleotides, and the fluorophore is free to emit its 
excitation energy for monitoring. The Scorpion primer resembles the sunrise 
primer, but derivate in having a moiety that blocks duplication on the signal- 
ling portion of the scorpion primer. The blocking moiety is typically hexethyl- 
ene glycol. In addition to the difference in structure, the function of the scor- 

20 pion primers differs slightly in that the 5' region of the oligonucleotide is de- 
signed to hybridise to a complementary region within the amplicons. In a cer- 
tain embodiment the complementary region is a codon on the identifier oli- 
gonucleotide. The hybridisation forces the labels apart disrupting the hairpin 
and permitting emission in the same way as the hairpin probes. 

25 

Methods for forming a library of complexes 

The complexes comprising an identifier part having two or more codons that 
code for reactants that have reacted in the formation of the encoded mole- 
cule part of the complex may be formed by a variety of processes. Generally, 
30 the preferred methods can be used for the formation of virtually any kind of 
encode molecule. Suitable examples of processes include prior art methods 
disclosed in WO 93/20242, WO 93/06121, WO 00/23458, WO 02/074929, 



WO 2005/026387 PCT/DK2004/000630 

24 

and WO 02/103008, the content of which being incorporated herein by refer- 
ence as well as methods of the present applicant not yet public available, 
including the methods disclosed in DK PA 2002 01955 filed 19 December 
2002, and DK PA 2003 00430 filed 20 March 2003. Any of these methods 
5 may be used, and the entire content of the patent applications are included 
herein by reference. 

Below four preferred embodiments are described. A first embodiment 
disclosed in more detail in WO 02/103008 is based on the use of a 

10 polymerase to incorporate unnatural nucleotides as building blocks. Initially, a 
plurality of template oligonucleotides is provided. Subsequently primers are 
annealed to each of the templates and a polymerase is extending the primer 
using nucleotide derivatives which have appended chemical entities. 
Subsequent to or simultaneously with the incorporation of the nucleotide 

15 derivatives, the chemical entities are reacted to form a reaction product. The 
encoded molecule may be post-modified by cleaving some of the linking 
moieties to better present the encoded molecule. 

- Several possible reaction approaches for the chemical entities are apparent. 

20 First, the nucleotide derivatives can be incorporated and the chemical entities 
subsequently polymerised. In the event the chemical entities each carry two 
reactive groups, the chemical entities can be attached to adjacent chemical 
entities by a reaction of these reactive groups. Exemplary of the reactive 
groups are amine and carboxylic acid, which upon reaction form an amide 

25 bond. Adjacent chemical entities can also be linked together using a linking 
or bridging moiety. Exemplary of this approach is the linking of two chemical 
entities each bearing an amine group by a bi-carboxylic acid. Yet another 
approach is the use of a reactive group between a chemical entity and the 
nucleotide building block, such as an ester or a thioester group. An adjacent 

30 building block having a reactive group such as an amine may cleave the 

interspaced reactive group to obtain a linkage to the chemical entity, e.g. by 
an amide linking group. 
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A second embodiment for obtainment of complexes pertains to the use of 
hybridisation of building blocks to an identifier oligonucleotide and reaction of 
chemical entities attached to the building blocks in order to obtain a reaction 
5 product This approach comprises that templates are contacted with a 
plurality of building blocks, wherein each building block comprises an anti- 
codon and a chemical entity. The anti-codons are designed such that they 
recognise a sequence, i.e. a codon, on the template. Subsequent to the 
annealing of the anti-codon and the codon to each other a reaction of the 
10 chemical entity is effected. 

The template may be associated with a scaffold. Building blocks bringing 
chemical entities in may be added sequentially or simultaneously and a 
reaction of the reactive group of the chemical entity may be effected at any 
15 time after the annealing of the building blocks to the template. 

A third embodiment for the generation of a complex includes chemical or 
enzymatical ligation of building blocks when these are lined up on a template. 
Initially, templates are provided, each having one or more codons. The 
20 templates are contacted with building blocks comprising anti-codons linked to 
chemical entities. The two or more anti-codons annealed on a template are 
subsequently ligated to each other and a reaction of the chemical entities is 
effected to obtain a reaction product. The method is disclosed in more detail 
in DK PA 2003 00430 filed 20 March 2003. 

25 

A fourth embodiment makes use of the extension by a polymerase of an 
affinity sequence of the nascent complex to transfer the anti-codon of a 
building block to the nascent complex. The method implies that a nascent 
complex comprising a scaffold and an affinity region is annealed to a building 
30 block comprising a region complementary to the affinity section. 

Subsequently the anti-codon region of the building block is transferred to the 
nascent complex by a polymerase. The transfer of the chemical entity may 



WO 2005/026387 PCT/DK2004/000630 

26 

be transferred prior to, simultaneously with or subsequent to the transfer of 
the anti-codon. This method is disclosed in detail in DK PA 2002 01955 filed 
19 December 2002. 

5 After or simultaneously with the formation of the reaction product some of the 
linkers to the template may be cleaved, however at least one linker must be 
maintained to provide for the complex. 

Nucleotides 

10 The nucleotides used in the present invention may be linked together in a 
sequence of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is 
normally composed of two parts, namely a nucleobase moiety, and a back- 
bone. The back bone may in some cases be subdivided into a sugar moiety 
and an internucleoside linker. 

15 

The nucleobase moiety may be selected among naturally occurring nucleo- 
bases as well as non-naturally occurring nucleobases. Thus, "nucleobase" 
includes not only the known purine and pyrimidine hetero-cycles, but also 
heterocyclic analogues and tautomers thereof. Illustrative examples of nu- 

20 cleobases are adenine, guanine, thymine, cytosine, uracil, purine, xanthine, 
diaminopurine, S-oxo-N^methyladenine, 7-deazaxanthine, 7-deazaguanine, 
N 4 ,N 4 -ethanocytosin, N 6 ,N 6 -ethano-2,6-diamino-purine, 5-methylcytosine, 5- 
(C 3 -C 6 )-alkynylcytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2- 
hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, inosine and the 

25 "non-naturally occurring" nucleobases described in Benner et al., U.S. Pat 
No. 5,432,272. The term "nucleobase" is intended to cover these examples 
as well as analogues and tautomers thereof. Especially interesting nucleo- 
bases are adenine, guanine, thymine, cytosine, 5-methylcytosine, and uracil, 
which are considered as the naturally occurring nucleobases in relation to 

30 therapeutic and diagnostic application in humans. 

Examples of suitable specific pairs of nucleobases are shown below: 
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Natural Base Pairs 



R=H: Uracil 

R R=CH3: Thymine Cytosine 

•TV 



o 

N 0 „ 



NH 2 HN— ^ Backbone 

Backbone ^j-X J< ' ° 

A . . Backbone 
Adenine 

Guanine 



Nv4 N O o ^V^l 




Synthetic purine bases pairring with natural pyrimidines 
R=H: Uracil 

r R=CH 3 : Thymine Cytosine 

NH2 HN^ BaCkb ° ne 
Backbone O 



7-deaza adenine 

7-deaza guanine 



Suitable examples of backbone units are shown below (B denotes a nucleo- 
5 base): 
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The sugar moiety of the backbone is suitably a pentose but may be the ap- 
propriate part of an PNA or a six-member ring. Suitable examples of possible 
5 pentoses include ribose, 2-deoxyribose, 2*-0-methyl-ribose, 2'-flour-ribose, 
and 2'-4'-0-methylene-ribose (LNA). Suitably the nucleobase is attached to 
the 1 ' position of the pentose entity. 



An intemucleoside linker connects the 3' end of preceding monomer to a 5' 
1 0 end of a succeeding monomer when the sugar moiety of the backbone is a 
pentose, like ribose or 2-deoxyribose. The intemucleoside linkage may be 
the natural occurring phospodiester linkage or a derivative thereof. Exam- 
ples of such derivatives include phosphorothioate, methylphosphonate, 
phosphoramidate, phosphotriester, and phosphodithioate. Furthermore, the 
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intemucleoside linker can be any of a number of non-phosphorous-containing 
linkers known in the art. 

Preferred nucleic acid monomers include naturally occurring nucleosides 
5 forming part of the DNA as well as the RNA family connected through phos- 
phodiester linkages. The members of the DNA family include de- 
oxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine. The 
members of the RNA family include adenosine, guanosine, uridine, cytidine, 
and inosine. Inosine is a non-specific pairing nucleoside and may be used as 
10 universal base because inosine can pair nearly isoenergetically with A, T, 
and C. Other compounds having the same ability of non-specifically base- 
pairing with natural nucleobases have been formed. Suitable compounds 
which may be utilized in the present invention includes among others the 
compounds depicted below 
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Examples of Universal Bases: 




5 



Building block 

The chemical entities that are precursors for structural additions or elimina- 
tions of the encoded molecule may be attached to a building block prior to 
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the participation in the formation of the reaction product leading the final en- 
coded molecule. Besides the chemical entity, the building block generally 
comprises an anti-codon. 

5 The chemical entity of the building block comprises at least one reactive 
group capable of participating in a reaction which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 
a scaffold associated with the nascent complex. The connection is facilitated 
by one or more reactive groups of the chemical entity. The number of reac- 

10 tive groups which appear on the chemical entity is suitably one to ten. A 
building block featuring only one reactive group is used i.a. in the end posi- 
tions of polymers or scaffolds, whereas building blocks having two reactive 
groups are suitable for the formation of the body part of a polymer or scaf- 
folds capable of being reacted further. One, two or more reactive groups in- 

15 tended for the formation of connections, are typically present on scaffolds. 

The reactive group of the building block may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
20 group of the nascent complex through a bridging fill-in group. It is to be un- 
derstood that not all the atoms of a reactive group are necessarily maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection. 

25 The subsequent cleavage step to release the chemical entity from the build- 
ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a reagent or an enzyme. The cleavage 
results in a transfer of the chemical entity to the nascent encoded molecule 
or in a transfer of the nascent encoded molecule to the chemical entity of the 

30 building block. In some cases it may be advantageous to introduce new 
chemical groups as a consequence of linker cleavage. The new chemical 
groups may be used for further reaction in a subsequent cycle, either directly 
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or after having been activated. In other cases it is desirable that no trace of 
the linker remains after the cleavage. 

In another aspect, the connection and the cleavage is conducted as a simul- 
5 taneous reaction, i.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction. In general, it is 
preferred to design the system such that the connection and the cleavage 
occur simultaneously because this will reduce the number of steps and the 
complexity. The simultaneous connection and cleavage can also be designed 
10 such that either no trace of the linker remains or such that a new chemical 
group for further reaction is introduced, as described above. 

The attachment of the chemical entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the chemi- 

15 cal entity can be attached to a nucleobase or the backbone. In general, it is 
preferred to attach the chemical entity at the phosphor of the internucleoside 
linkage or at the nucleobase. When the nucleobase is used for attachment of 
the chemical entity, the attachment point is usually at the 7 position of the 
purines or 7-deaza-purins or at the 5 position of pyrimidines. The nucleotide 

20 may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational space 
sampled by the reactive group is optimized for a reaction with the reactive 
group of the nascent encoded molecule or reactive site. 

25 The anticodon complements the codon of the identifier sequence and gener- 
ally comprises the same number of nucleotides as the codon. The anticodon 
may be adjoined with a fixed sequence, such as a sequence complementing 
a framing sequence. 

30 Various specific building blocks are envisaged. Building blocks of particular 
interest are shown below. 



WO 2005/026387 



PCT/DK2004/000630 



33 



Specific Building blocks 

Building blocks transferring a chemical entity to a recipient nucleophilic group 
The building block indicated below is capable of transferring a chemical entity 
(CE) to a recipient nucleophilic group, typically an amine group. The bold 
5 lower horizontal line illustrates the building block and the vertical line illus- 
trates a spacer. The 5-membered substituted N-hydroxysuccinimid (NHS) 
ring serves as an activator, i.e. a labile bond is formed between the oxygen 
atom connected to the NHS ring and the chemical entity. The labile bond 
may be cleaved by a nucleophilic group, e.g. positioned on a scaffold 



The 5-membered substituted N-hydroxysuccinimid (NHS) ring serves as an 
activator, i.e. a labile bond is formed between the oxygen atom connected to 

15 the NHS ring and the chemical entity. The labile bond may be cleaved by a 
nucleophilic group, e.g. positioned on a scaffold, to transfer the chemical en- 
tity to the scaffold, thus converting the remainder of the fragment into a leav- 
ing group of the reaction. When the chemical entity is connected to the acti- 
vator through an carbonyl group and the recipient group is an amine, the 

20 bond formed on the scaffold will an amide bond. The above building block is 
the subject of the Danish patent application No. PA 2002 01946 and the US 
provisional patent application No. 60/434,439, the content of which are incor- 
porated herein in their entirety by reference. 

25 Another building block which may form an amide bond is 



10 
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R may be absent or N0 2 , CF 3 , halogen, preferably CI, Br, or I, and Z may be 
S or O. This type of building block is disclosed in Danish patent application 
No. PA 2002 0951 and US provisional patent application filed 20 December 
5 2002 with the title "A building block capable of transferring a functional entity 
to a recipient reactive group". The content of both patent application are in- 
corporated herein in their entirety by reference. 

A nucleophilic group can cleave the linkage between Z and the carbonyl 
10 group thereby transferring the chemical entity -(C=0)-CE' to said nucleo- 
philic group. 

Building blocks transferring a chemical entity to a recipient reactive group 
forming a C=C bond 

15 A building block as shown below are able to transfer the chemical entity to a 
recipient aldehylde group thereby forming a double bond between the carbon 
of the aldehyde and the chemical entity 
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CE 




O 



The above building block is comprised by the Danish patent application No. 
DK PA 2002 01 952 and the US provisional patent application filed 20 De- 
5 cember 2002 with the title "A building block capable of transferring a func- 
tional entity to a recipient reactive group forming a C=C double bond". The 
content of both patent applications are incorporated herein in their entirety by 



1 0 Building blocks transferring a chemical entity to a recipient reactive group 
forming a C-C bond 

The below building block is able to transfer the chemical entity to a recipient 
group thereby forming a single bond between the receiving moiety, e.g. a 
scaffold, and the chemical entity. 



The above building block is comprised by the Danish patent application No. 
DK PA 2002 01947 and the US provisional patent application No 60/434,428. 
The content of both patent applications are incorporated herein in their en- 
20 tirety by reference. 



reference. 
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Another building block capable of transferring a chemical entity to a receiving 
reactive group forming a single bond is . 

O .. 




5 I 

The receiving group may be a nucleophile, such as a group comprising a 
hetero atom, thereby forming a single bond between the chemical entity and 
the hetero atom, or the receiving group may be an electronegative carbon 
10 atom, thereby forming a Q-C bond between the chemical entity and the scaf- 
fold. 

The chemical entity attached to any of the above building blocks may be a 
selected from a large arsenal of chemical structures. Examples of chemical 
15 entities are 

H or entities selected among the group consisting of a d-C 6 alkyl, C 2 -C 6 al- 
kenyl, C2-C6 alkynyl, C 4 -C 8 alkadienyl, C3-C7 cycloalkyl, C3-C7 cycloheteroal- 
kyl, aryl, and heteroaryl, said group being substituted with 0-3 R 4 , 0-3 R 5 and 
0-3 R 9 or d-C 3 alkylene-NR 4 2 , C1-C3 alkylene-NR 4 C(0)R 8 , d-C 3 al- 

20 kylene-NR 4 C(0)OR 8 , C1-C2 alkylehe-0-NR 4 2 , d-Q> alkylene-0-NR 4 C(0)R 8 , 
C1-C2 alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 . 

where R 4 is H or selected independently among the group consisting 
of d-C 6 alkyl, C2-C 6 alkenyl, C2-C 6 alkynyl, C 3 -C 7 cycloalkyl, C3-C7 cyclohet- 
eroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

25 R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , -NHOH, 

-NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of 
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C 2 -C 6 alkenyl, C2-C 6 alkynyl, C 4 -C 8 alkadienyl said group being substituted 
with 0-2 R 7 , 

where R 6 is selected independently from H, Ci-C 6 alkyl, C3.C7 cycloal- 
kyl, aryl or Ci-C 6 alkylene-aryl substituted with 0-5 halogen atoms selected 
5 from -F, -CI, -Br, and -I; and R 7 is independently selected from -N0 2f 
-COOR 6 , -COR 6 , -CN, -OSiR 6 3 , -OR 6 and -NR 6 2 . 

R 8 is H, d-C 6 alkyl, C 2 -C 6 alkenyl, C^Ce alkynyl, C3-C7 cycloalkyl, aryl or 
Ci~C 6 alkylene-aryl substituted with 0-3 substituents independently selected 
from -F, -CI, -N0 2 , -R 3 , -OR 3 , -SiR 3 3 
10 R 9 is =0, -F, -CI, -Br, -I, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , 

-NR 6 -C(0)OR 8 , -SR 6 , -S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and 
-S(0) 2 NR 6 2 . 

Cross-link cleavage building blocks 

15 It may be advantageous to split the transfer of a chemical entity to a recipient 
reactive group into two separate steps, namely a cross-linking step and a 
cleavage step because each step can be optimized. A suitable building block 
for this two step process is illustrated below: 



Initally, a reactive group appearing on the functional entity precursor (abbre- 
viated FEP) reacts with a recipient reactive group, e.g. a reactive group ap- 
pearing on a scaffold, thereby forming a cross-link. Subsequently, a cleavage 
25 is performed, usually by adding an aqueous oxidising agent such as l 2 , Br 2 , 



R? 




R 3 



A 



20 
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CI2, H + , or a Lewis acid. The cleavage results in a transfer of the group HZ- 
FEP- to the recipient moiety, such as a scaffold. 

In the above formula 
5 Z is O, S, NR 4 

Q is N, CR 1 

P is a valence bond, O, S, NR 4 , or a group Cs-yarylene, d-ealkylene, 
C-i-eO-alkylene, d-eS-alkylene, NR 1 -alkylene, d-ealkylene-O, d-salkylene-S 
option said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or C1-C3 
1 0 alkylene-NR 4 2 , d-C 3 alkylene-NR 4 C(0)R 8 . C1-C3 alkylene-NR 4 C(0)OR 8 , 
d-C 2 alkylene-0-NR 4 2 , C1-C2 alkylene-0-NR 4 C(0)R 8 , d-C 2 al- 
kylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 

B is a group comprising D-E-F, in which 

D is a valence bond or a group d-ealkylene, d-6alkenylene, d- 
15 6 alkynylene, C5-7arylene, or Cs^heteroarylene, said group optionally being 
substituted with 1 to 4 group R 11 , 

E is, when present, a valence bond, O, S, NR 4 , or a group d- 
6 alkylene, d-ealkenylene, d-ealkynylene, C^arylene, or Cs-yheteroarylene, 
said group optionally being substituted with 1 to 4 group R 1 \ 
20 F is, when present, a valence bond, O, S, or NR 4 , 

A is a spacing group distancing the chemical structure from the com- 
plementing element, which may be a nucleic acid, 

R\ R 2 , and R 3 are independent of each other selected among the 
group consisting of H, Ci-C 6 alkyl, C2-C 6 alkenyl, C2-C 6 alkynyl, C 4 -C 8 alkadi- 
25 enyl, C3-C7 cycloalkyl, C3-C7 cycloheteroalkyl, aryl, and heteroaryl, said 
group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or d-C 3 al- 
kylene-NR 4 2 , C1-C3 alkylene-NR 4 C(0)R 8 , d-C 3 alkylene-NR 4 C(0)OR 8 , d-C 2 
alkylene-0-NR 4 2l C r C 2 alkylene-0-NR 4 C(0)R 8 , C1-C2 al- 
kylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 
30 FEP is a group selected among the group consisting of H, d-C 6 alkyl, 

C2-C 6 alkenyl, C 2 -C 6 alkynyl, d-C 8 alkadienyl, C 3 -C 7 cycloalkyl, C3-C7 cyclo- 
heteroalkyl, aryl, and heteroaryl, said group being substituted with. 0-3. R 4 , 0-3 
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R 5 and 0-3 R 9 or C r C 3 alkylene-NR 4 2 , C1-C3 alkylene-NR 4 C(0)R 8 , C1.-C3 al- 
kylene-NR 4 C(0)OR 8 , C r C 2 alkylene-0-NR 4 2 , d-C 2 alky!ene-0-NR 4 C(0)R 8 , 
C r C 2 a!kyIene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 

where R 4 is-H or selected independently among the group consisting 
5 of CrC 6 alkyl, C 2 -C 6 alkenyl, C^Ce alkynyl, C 3 -C 7 cycloalkyl, C 3 -C 7 cyclohet- 
eroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , -NHOH, 
-NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(0R 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of 
C^Ce alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl said group being substituted 
10 with 0-2 R 7 , 

where R 6 is selected independently from H, CrC 6 alkyl, C^Cy cycloal- 
kyl, aryl or Ci-C 6 alkylene-aryl substituted with 0-5 halogen atoms selected 
from -F, -CI, -Br, and -I; and R 7 is independently selected from -N0 2 , 
-COOR 6 , -COR 6 , -CN, -OSiR 6 3 , -OR 6 and -NR 6 2 . 
15 R 8 is H, Ci-C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 3 -C 7 cycloalkyl, aryl or 
Ci-C 6 alkylene-aryl substituted with' 0-3 substituents independently selected 
from -F, -CI, -N0 2 , -R 3 , -OR 3 , -SiR 3 3 

R 9 i s = 0l -F, -CI, -Br, -I, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , 
-NR 6 -C(0)OR 8 , -SR 6 , -S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and 
20 -S(0) 2 NR 6 2 . 

In a preferred embodiment Z is O or S, P is a valence bond, Q is CH, B is 
CH 2 , and R 1 , R 2 , and R 3 is H. The bond between the carbonyl group and Z is 
cleavable with aqueous l 2 . 

25 

Partitioning 

The partitioning step, by which the library of bifunctional molecules is sub- 
jected to a condition partitioning one or more complexes having a certain 
property from the remainder of the library, may be referred to as the enrich- 
30 ment step or the selection step, as appropriate, and includes the screening of 
the library for encoded molecules having predetermined desirable character- 
istics. Predetermined desirable characteristics can include binding to a target, 
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catalytically changing the target, chemically reacting with a target in a man- 
ner which alters/modifies the target or the functional activity of the target, and 
covalently attaching to the target as in a suicide inhibitor. 

5 In theory, molecules of interest can be selected based on their properties 
using either physical or physiological procedures. The method preferred ac- 
cording to the present invention is to enrich molecules with respect to binding 
affinity towards a target of interest. In a certain embodiment, the basic steps 
involve mixing the library of complexes with the immobilized target of interest. 

1 0 The target can be attached to a column matrix or microtitre wells with direct 
immobilization or by means of antibody binding or other high-affinity interac- 
tions. In another embodiment, the target and displayed molecules interact 
without immobilisation of the target. Displayed molecules that bind to the tar- 
get will be retained by a filter, size-exclusion chromatography etc, while non- 

15 binding displayed molecules will be removed during a single or a series of 
wash steps. The identifiers of complexes bound to the target can then be 
separated by cleaving a physical connection to the encoded molecule or the 
entire complex may be eluted. It may be considered advantageously to per- 
form a chromatography step after or instead of the washing step. After the 

20 . cleavage of the physical link between .the synthetic molecule and the identi- 
* fier, the identifier may be recovered from the media and optionally amplified 
before the decoding step. 

A significant reduction in background binders may be obtained with increased 
25 washing volumes, repeating washing steps, higher detergent concentrations 
and prolonged incubation during washing. Thus, the more volume and num- 
ber of steps used in the washing procedure together with more stringent con- 
ditions will more efficiently remove non-binders and background binders. The 
right stringency in the washing step can also be used to remove low-affinity 
30 specific binders. However, the washing step will also remove wanted binders 
if too harsh conditions are used. 
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A blocking step, such as incubation of solid phase with skimmed milk pro- 
teins or other inert proteins and/or mild detergent such as Tween-20 and Tri- 
ton X-100, may also be used to reduce the background. The washing condi- 
tions should be as stringent as possible to remove background binding but to 
5 retain specific binders that interact with the immobilized target. Generally, 
washing conditions are adjusted to maintain the desired affinity binders, e.g. 
binders in the micromolar, nanomolar, or pocomolar range. 

The target can be any compound of interest. E.g. the target can be a protein, 
10 peptide, carbohydrate, polysaccharide, glycoprotein, hormone, receptor, an- 
tigen, antibody, virus, substrate, metabolite, transition state analogue, cofac- 
tor, inhibitor, drug, dye, nutrient, growth factor, cell, tissue, etc. without limita- 
tion. Suitable targets include, but are not limited to, angiotensin converting 
enzyme, renin, cyclooxygenase, 5-lipoxygenase, IIL- 1 0 converting enzyme, 
15 cytokine receptors, PDGF receptor, type II inosine monophosphate dehydro- 
genase, p-Iactamases, integrin, proteases like factor Vila, kinases like Bcr- 
Abl/Her, phosphotases like PTP-1B, and fungal cytochrome P-450. Targets 
can include, but are not limited to, bradykinin, neutrophil elastase, the HIV 
proteins, including tat, rev, gag, int, RT, nucleocapsid etc., VEGF, bFGF, 
20 TGFp, KGF, PDGF, GPCR, thrombin, substance P, IgE, sPLA2, red blood 
cells, glioblastomas, fibrin clots, PBMCs, hCG, lectins, selectins, cytokines, 
ICP4, complement proteins, etc. 

A target can also be a surface of a non-biological origin, such as a polymer 
25 surface or a metal surface. The method of the invention may then be used to 
identify suitable coatings for such surfaces. 

In a preferred embodiment, the desirable synthetic molecule acts on the tar- 
get without any interaction between the nucleic acid attached to the desirable 
30 encoded molecule and the target. In one embodiment, the bound complex- 
target aggregate can be partitioned from unbound complexes by a number of 
methods. The methods include nitrocellulose filter binding, column chroma- 



WO 2005/026387 PCT/DK2004/000630 

42 

tography, filtration, affinity chromatography, centrifugation, and other well 
- known methods. A preferred method is size-exclusion chromatography. 

Briefly, the library of complexes is subjected to the target, which may include 
5 contact between the library and a column onto which the target is immobi- 
lised. Identifiers associated with undesirable encoded molecules, i.e. syn- 
thetic molecules not bound to the target under the stringency conditions 
used, will pass through the column. Additional undesirable encoded mole- 
cules (e.g. encoded molecules which cross-react with other targets) may be 

10 removed by counter-selection methods. Desirable complexes are bound to 
the column. The target may be immobilized in a number of ways. In one em- 
bodiment, the target is immobilized through a cleavable physical link, such as 
one more chemical bonds. The aggregate of the target and the complex may 
then be subjected to a size exclusion chromatography to separate the aggre- 

15 gate from the rest of the compounds in the media. The complex may then be 
eluted from the target by changing the conditions (e.g., salt, pH, surfactant, 
temperature etc.). Alternatively, the complex may be provided with a cleav- 
able linker, preferable orthogonal to a cleavable linker that attaches the target 
to the solid support, at a position between the synthetic molecule and the 

20 identifier. Subsequent to the size exclusion chromatography this cleavable 
linker is cleaved to separate the identifiers of complexes having affinity to- 
wards the targets. Just to mention a single type of orthogonal cleavable link- 
ages, one could attach the target to the solid support through a linkage that 
can be cleaved by a chemical agent, and the linker separating the synthetic 

25 molecule and the identifier may be selected as a photocleavable linkage. 
More specifically, the former linkage may be a disulphide bond that can be 
cleaved by a suitable reducing agent like DTT (dithiothreitol) and the latter 
linkage may be a o-nitrophenyl group. 

30 There are other partitioning and screening processes which are compatible 
with this invention that are known to one of ordinary skill in the art. In one 
embodiment, the products can be fractionated by a number of common 
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methods and then each fraction is assayed for activity. The fractionization 
methods can include size, pH, hydrophobicity, etc. 

Inherent in the present method is the selection of encoded molecules on the 
5 basis of a desired function; this can be extended to the selection of mole- 
cules with a desired function and specificity. Specificity can be required dur- 
ing the selection process by first extracting complexes which are capable of 
interacting with a non-desired "target" (negative selection, or counter- 
selection), followed by positive selection with the desired target. As an exam- 

10 pie, inhibitors of fungal cytochrome P-450 (fungicides) are known to cross- 
react to some extent with mammalian cytochrome P-450 (resulting in serious 
side effects). Highly specific inhibitors of the fungal cytochrome could be se- 
lected from a library by first removing those complexes capable of interacting 
with the mammalian cytochrome, followed by retention of the remaining 

15 products which are capable of interacting with the fungal cytochrome. 

Brief Description of the Figures 

Fig. 1 1 discloses two embodiments of using a Taqman probe (5' nuclease 
probe) in the measurement of the presence or absence of a certain codon. 
20 Fig. 12 discloses a standard curve used in example 3. 

Fig. 13 discloses the result of two experiments reported in example 3. 

Example 3 - illustrating the first aspect of the present invention 

A preferred embodiment of the invention utilizing a universal Taqman probe 
25 is shown in Fig.1 1. Four codons are shown (P1 through P4; bold pattern) 

along with flanking regions (light pattern). A universal Taqman probe anneals 
to a region adjacent to the codon region, but within the amplicon defined by 
the universal PCR primers Pr.1 and Pr. 2. These primers could be the same 
as used for amplification of the identifier oligonucleotides encoding binders 
30 after an enrichment process on a specific target. However, if minimal length 
templates are preferred during the encoding process, the region involved in 
Taqman probe annealing could be appended to the library identifier oligonu- 
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cleotides by e.g. overlap PCR, ligation, or by employing a long downstream 
PCR primer containing the necessary sequences. The added length corre- 
sponding to the region necessary for annealing of the Taqman probe would 
be form 20 to 40 nts depending on the type of TaqMan probe and T A of the 
5 PCR primers. The Q-PCR reactions are preferably performed in a 96- or 384- 
well format on a real-time PCR thermocycling machine. 

Fig. 1 1 A shows the detection of abundance of a specific codon sequence in 
position one. Similar primers are prepared for all codon sequences. For each 

10 codon sequence utilized to encode a specific BB in the library a Q-PCR reac- 
tion is performed with a primer oligonucleotide complementary to the codon 
sequence in question. A downstream universal reverse primer Pr. 2 is pro- 
vided after the Taqman probe to provide for an exponential amplification of * 
the PCR amplicon. The setup is most suited for cases where the codon con- 

15 stitutes a length corresponding to a length suitable for a PCR primer. 

Fig. 1 1 B shows the detection of abundance of a specific codon sequence in a 
specific codon position using a primer which is complementing a codon and a 
framing sequence. Similar primers are used for all the codons and framing 

20 sequences. For each codon sequence utilized to encode a specific BB at a 
specific codon position in the library a Q-PCR reaction is performed with an 
oligo complementary to the codon sequence in question as well as a short 
region up- or downstream of the codon region which ensures extension of the 
primer in a PCR reaction only when annealed to the codon sequence in that 

25 specific codon position. The number of specific primers and Q-PCR reactions 
needed to cover all codon sequences in all possible codon positions equals 
the number of codon sequences times the number of codon positions. Thus, 
monitoring the abundance of 96 different codon sequences in 4 different po- 
sitions can be performed in a single run on four 96 wells micro titre plates (as 

30 shown in Fig. 1 1B) or a single 384 well plate on a suitable instrument. This 
architecture allows for the decoding of a 8,5 x 10 7 library of different encoded 
molecules. 
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Quantification is performed relative to the amount of full-length PCR product 
. obtained in a parallel control reaction on the same input material performed 
with the two external PCR primers Pr.1 + Pr. 2. Theoretically, a similar rate of 
5 accumulation of this control amplicon compared to the accumulation of a 
product utilizing a single codon + sequence specific primer would indicate a 
1 00% dominance of this particular sequence in the position in question. 

Although the setups shown in Fig. 11A and 1 1B employ a Taqman probe 
10 strategy, other detection systems (SYBR green, Molecular Beacons etc.) 
could be utilized. In theory, multiplex reactions employing up to 4 different 
fluorofors in the same reaction could increase throughput correspondingly. 

An example of how a deconvolution process of a library of encoded mole- 

15 cules occurs is described in the following. Imagine that at the end of a selec- 
tion scheme a pool of 3 ligand families (and the corresponding coding tem- 
plates) are dominating the population and present at approx. the same con- 
centration. Three different chemical entities are present in the first position of 
the encoded compounds, and each of these chemical entities are present in 

20 combination with one unique chemical entity out of 3 different chemical enti- 
ties in position P2. Only one chemical entity in position 3 gives rise to active 
binders, whereas any of a 20% subset of chemical entities (e.g. determined 
by charge, size or other characteristica) are present in position 4. The out- 
come of the initial codon profile analysis would be: 3 codon sequences are 

25 equally dominating in position P1, 3 other codon sequences in position P2, 1 
unique codon sequence is dominant in P3 whereas somewhat similarly in- 
creased levels of 20% of the codon sequences (background levels of the re- 
maining 80% sequences) are seen in P4. In such cases it could be relevant 
to use an iterative Q-PCR ("IQ-PCR") strategy to perform a further deconvo- 

30 lution of a library after selection. Again with reference to the example above, 
by taking the PCR products from the 3 individual wells that contained primers 
giving the high yields in position P1, diluting the product appropriately and 
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performing a second round of Q-PCR on each of these identifier oligonucleo 
tides separately, it would be possible to deduce which codon sequence(s) is 
preferred in P2 when a given codon sequence is present in P1 . 
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Synthesis of identifier oligonucleotides: 

The 10 identifier oligonucleotides were assembled in 10 seperate 50 ul PCR 
reactions each containing 0.05 pmol of the oligos Q-Temp1-X, Q-Temp2, Q- 
Temp3-X and Q-Temp4 (x=1 through 10) and 25 pmol of the external prim- 
5 ers FPv2 and RPv2 with TA= 53°C. The 1 60 bp products were gel-purified 
using QIAquick Gel Extraction Kit from QIAGEN (Cat. No. 28706) and quanti- 
fied on spectrophotometer. As a control, 20 ng of each of the templates (as 
estimated from these measurements) were loaded on an agarose gel. 

10 Preparation of samples for Q-PCR: 

Sample A: Generated by mixing 20 ng from each identifier oligonucleotide 
prep. Volume was adjusted to 50 ul. Concentration: 4 ng/ul = 38.46 fmol/uJ 
(160bp x 650 Da/bp =1.04x105 g/mol. 1 ng= 9.615 fmol). Diluted to 10 7 cop- 
ies/5ul (0.00332 fmol/ul). 

15 

Sample B: 20 ng/20ul stocks of each template were prepared. The sample 

was mixed as follows: 

5ul undil. Template #10 

5ul 2x dil. Template #9 
20 5ul 4x dil. Template #8 

5ul 8x dil. Template #7 

5ul 16xdil. Template #6 

5ul 32x dil. Template #5 

5ul 64x dil. Template #4 
25 5ul 128x dil. Template #3 

5ul 256x dil. Template #2 

5ul 51 2x dil. Template #1 

Concentration: 10ng/50ul= 0.20 ng/pl = 1.923 fmol/ul. Diluted 579.2-fold to 
10 7 copies/5pl (0.00332 fmol/ul). 

30 



Standard curve : The samples for the standard curve was prepared by diluting 
Sample A 1 16.55-fold to 10 9 copies/5 pi (0.33 fmol//pl) and subsequently per- 
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forming a 10-fold serial dilution of this sample. 5 pi was used for each PCR 
reaction. The standard curve is shown in Fig. 12. 

Q-PCR reactions 
5 For 5 ml premix (for one 96-well plate): 

2.5 ml Taqman Universal PCR Master Mix (Applied Biosystems; includes Taq 
polymerase, dNTPs and optimized Taq pol. buffer) 
450 pi RPv2(10pmol/ul) 

25 pi Taqman probe (6-FAM-TCCAGCTTCTAGGAAGAC-MGBNFQ; 50 pM; 
10 Applied Biosystems) 
1075 pi H20 

40.5 pi premix was aliquoted into each well and 4.5 pi of relevant upstream 
PCR primer (FPv2 (for standard curve) or one of the codon specific primers 
15 listed below; 10 pmol/pl) and 5 pi sample (H20 in wells for negative controls) 
was added. The codon-specific PCR primers were: (Tm calculations shown 
are from Vector NTI; matched to Tm for RPv2 (67.7°C)) 

P1-1: GTCATACTAGCTG CTAG AG ATGTGGTG ATA 66.8°C 

20 P1-2: CATACGGAAGAAGACAGAAGACCTGATA 67.8°C 

P1-3: TCATACTCAGGAGTCGAGAACTGAAGATA 67.6°C 

P1-4: CATACTGTGTACGTCAACACGTCAGATA 67.4°C 

P1-5: CATACTGTGGAACTACCATCCAAGGATA 68.0°C 

P1-6: CCATCCAACATCGTTGGAAGAT 67.8°C 

25 P1-7: CATACAACCTGTCCTGTGAGATCTGATA 67.7°C 

P1-8: ATACTCACGAAGCTGGATGATGAGATA 67.3°C 

P1-9: CATACTAGCATCGATCGAACGTAGGATA 68.1°C 

P1-10: TCATACTCGAAGCTACTGTCGAGATGATA 68.2°C 

P2-1: ATATTAGTGTGTGACGATGGTACGCA 67.8°C 

30 P3-1: ACAAGTACGAACGTGCATCAGAGA 67.7°C 
P4-1: CGAGCAGGACCTGGAACCT 67.7°C 

P4-2: TCGACCACTGCAGGTGGA 68.3°C 
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P4-3: 


GCTTCCTCTGCTGCACCA 


l 


P4-4: 


GGTGTCGAGGTGAGCAGCA 


69.1°C 


P4-5: 


CGACGAGGTCCATCCTGGT 


68.6°G 


P4-6: 


GTGAGGAGCAGGTCCTCCTGT 


68.0°C 


P4-7: 


CTG ACACTGGTGGTG GTCG A 


68.8°C 


P4-8: 


CATCTCGACGACCTGCTCCT 


67.9°C 


P4-9: 


ACGAGGTCTCCACTGGTCCA 


68.3°C 


P4-10: 


ACTGAGCTGCTCCTCCAGGT 


66.5°C 



10 Thermocycling/measurement of fluoresence was performed on an Applied 

Biosystems ABI Prism 7900HT real-time instrument utilizing the standard 

cycling parameters: 

95°C10min; 

40 cycles of 
15 95°C15sec; 

60°C 1 min 



All samples were run in duplicate. 
20 Results 

Fig. 12 shows the standard curve calculated by the 7900HT system software. 
The log of the starting copy number was plotted against the measured C T 
value. The relationship between Ct and starting copy number was linear in 
the range from 10 to 10 9 template copies. 

25 

This standard curve was utilized by the system software to calculate the 
quantity in the "unknown" samples as shown below. 



WO 2005/026387 



PCT/DK2004/000630 



Table 1: Sample A (Shown 
Sample 
A: Equi- 



molar 




ratios 


Observed A 


FPv2 


12539947,00 


P1-1 


445841,90 


P1-2 


884840,70 


P1-3 


1013073,56 


P1-4 


. 764187,94 


P1-5 


1352874,60 


P1-6 


1284075,60 


P1-7 


658161,80 


P1-8 


742187,20 


P1-9 


824587,75 


P1-10 


813550,75 


P2-1 


13145159,00 


P3-1 


13263911,00 


P4-1 


1430704,80 


P4-2 


2681652,00 


P4-3 


1933106,80 


P4-4 


1359684,40 


P4-5 


2206709,80 


P4-6 


1652718,10 


P4-7 


1468208,10 


P4-8 


1664467,50 


P4-9 


1462520,60 


P4-10 


2020088,20 
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graphically in Fig. 13A 



Observed B 


Expected 


11977503,00 


10000000 


480382,03 


1000000 


847478,56 


1000000 


948770,00 


1000000 


741304,40 


1000000 


1275155,50 


1000000 


1337928,50 


1000000 


747371,56 


1000000 


653874,00 


1000000 


705785,75 


1000000 


836037,90 


1000000 


14482606,00 


10000000 


12773780,00 


10000000 


1472576,80 


1000000 


2481824,80 


1000000 


2085476,40 


1000000 


1364621,40 


1000000 


2065813,60 


1000000 


1873777,20 


1000000 


1416153,00 


1000000 


1581067,00 


1000000 


1594593,80 


1000000 


1912277,40 


1000000 



Table II: Sample B (Shown graphically in Fig. 13B) 
Sample Observed 
B: 2-fold Observed A B Expected 
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dil. 




FPv2 


4.97E+06 


P1-1 


9955,07 


P1-2 


12732,32 


P1-3 


25542,8 


P1-4 


34748,89 


P1-5 


110881,41 


P1-6 


163687,44 


P1-7 


156993,81 


P1-8 


343176,78 


P1-9 


646619,44 


P1-10 


1.49E+06 


P2-1 


5.19E+06 


P3-1 


5.29E+06 


P4-1 


(no signal) 


P4-2 


42103,32 


P4-3 


54480,62 


P4-4 


51293,07 


P4-5 


137946,95 


P4-6 


174134,64 


P4-7 


316505,78 


P4-8 


737661,44 


P4-9 


1.42E+06 


P4-10 


3.72E+06 



5.05E+06 


10000000 


10899,97 


9765,625 


13469,12 


19531,25 


25419,85 


39062,5 


44070,81 


78125 


123734,13 


156250 


166220,5 


312500 


172005,64 


625000 


374809,13 


1250000 


576151 


2500000 


1.72E+06 


5000000 


5.37E+06 


10000000 


5.09E+06 


10000000 


70223,8 


9765,625 


22733,17 


19531,25 


39663,62 


39062,5 


43950,9 


78125 


115027,34 


156250 


156442,55 


312500 


283856,84 


625000 


691296,75 


1250000 


1.45E+06 


2500000 


3.52E+06 


5000000 



The results of the experiments show the possibility of accurately quantifica- 
tion of identifier oligonucleotides down to or even below 10 copies with a 9 
5 fold dynamic range, and reliable relative quantification of the tested codons in 
various positions in the identifier oligonucleotide. 
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While the invention has been described with references to specific methods 
and embodiments, it will be appreciated that various modifications and 
changes may be made without departing from the invention. All patent and 
literature references cited herein are hereby incorporated by reference in 
5 their entirety. 

The following items describe embodiments of the first aspect of the present 
invention: 

10 Item 1 : A method for obtaining structural information about an encoded mole- 
cule produced by a process comprising reaction of a plurality of chemical en- 
tities, said encoded molecule being capable of forming part of a complex also 
comprising an identifier oligonucleotide containing codons informative of the 
identity of chemical entities which have participated in the formation of the 

15 encoded molecule, the method comprises 

a) mixing a primer oligonucleotide with the identifier oligonu- 
cleotide, 

b) subjecting the mixture to a condition allowing for an exten- 
sion reaction to occur when the primer is sufficient complemen- 

20 tary to a part of the identifier oligonucleotide, and 

c) evaluating, based on measurement of the extension reaction, 
the presence, absence, or relative abundance of one or more 
codons. 

25 Item 2: The method according to Item 1 , wherein a composition of one, two, 
or more identifier oligonucleotides are processed simultaneously. 

Item 3: The method according to Item 2, the composition is a result of sub- 
jecting a library of different complexes to a condition partitioning one or more 
30 complexes having a certain property from the remainder of the library. 
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Item 4: The method according to Item 1 , wherein the condition which allows 
for an extension reaction to occur includes a polymerase or a ligase as well 
as suitable substrates. 

5 Item 5: The method according to Item 4, wherein the condition includes a 
polymerase and a substrate comprising a blend of (deoxy)ribonucleotide 
triphosphates. 

Item 6: The method according to any of the preceding Items, wherein the 
10 chemical entities are precursors for structural units appearing in the encoded 
molecule. 

Item 7: The method according to any of the Items 1 to 6, wherein the process 
of producing the one or more encoded molecules comprises transferring one 
15 or more chemical entities to a nascent encoded molecule by a building block 
which further comprises an anti-codon. 

Item 8: The method of Item 7, wherein the information of the anti-codon is 
transferred in conjunction with the chemical entity to the nascent encoded 
20 molecule. 

Item 9: The method according to any of the preceding Items 1 to 8, wherein 
the identifier comprises two or more codons. 

25 Item 1 0: The method according to any of the preceding Items 1 to 8, wherein 
the identifier comprises three or more codons. 

Item 11: The method according to any of the preceding Items, wherein 
neighbouring codons of the identifier are spaced by a framing sequence. 

30 
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Item 12: The method according to Item 1 1, wherein the framing sequence 
positions the reaction of a chemical entity in the synthesis history of the en- 
coded molecule. 

5 Item 13: The method according to any of the Items 1 to 12, wherein at least a 
part of the primer is complementary to a codon. 

Item 14: The method according to any of Items 1 to 13, wherein at least a 
part of the primer is complementary to a codon and an adjacent framing se- 
10 qunece. 

Item 15: The method according to any of the Items 1 to 13, wherein the 
codons have a length of four or more nucleotides. 

15 Item 16: The method according to any of the Items 1 to 15, wherein the se- 
quence comprising the codon and an adjacent framing sequence has a total, 
length of 1 1 nucleotides or more. 

Item 17: The method according to any of the Items 1 to 16, wherein the ex- 
20 tension reaction is measured using the polymerase chain reaction (PCR), 
wherein the primer of Item 1 is involved in said PCR. 

Item 18: The method according to any of the Items 1 to 17, wherein a primer 
is labelled. 

25 

Item 19: The method according to Item 18, wherein the primer is labelled with 
a small molecule, a radio active component, or a fluorogenic molecule. 

Item 20: The method according to Item 19, wherein the small molecule label 
30 . is selected from biotin, dinitrophenol, and digoxigenin, and the PCR ampli- 
cons are detected using an enzyme labelled streptavidin, anti-dinitrophenol. 
or anti-digoxigenin, respectively, reporter molecule. 



WO 2005/026387 PCT/DK2004/000630 

57 

Item 21: The method according to any of the Items 1 to .19, wherein exten- 
sion reaction is measured by real-time PGR. 

Item 22: The method according to Item 21, wherein the real-time PCR in- 
.5 volves the use of an oligonucleotide probe responsible for the generation of a 
detectable signal during the propagation of the PCR reaction. 

Item 23: The method according to any of the Items 1 to 21, wherein the probe 
is designed to hybridise at a position downstream of a primer binding site. 

10 

Item 24: The method according to Item 22 or 23, wherein the probe is a 5' 
nuclease oligoprobe or a hairpin oligoprobe. 

Item 25: The method according to Items 2 or 3, wherein the library comprises 
15 complexes with identifier oligonucleotides having n codon positions and the 
codons in said codon positions being selected from a set of m different 
codons. 

Item 26: The method according to Item 25, wherein a framing sequence is 
20 related to each of the n codon positions in a particular complex, said framing 
sequence positions the reaction of a chemical entity in the synthesis history 
of the encoded molecule. 

Item 27: The method according to Item 25, wherein each codon in the set of 
25 m different codons differs from any other codons in the set in at least two 
nucleotide positions. 

Item 28: The method according to Item 26, wherein each framing sequence 
in a set of n different framing sequences differs from any other framing se- 
30 quences in the set in at least two nucleotide positions. 
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Item 29: A method for identifying the chemical entities utilized in the forma- 
tion of an encoded molecule or a composition of encoded molecules, wherein 
in separate compartments, n x m primers individually are mixed with an ali- 
quot of a composition obtained by subjecting a library of different complexes 
5 to a condition partitioning said composition from the remainder of the library, 
subjected to a mixture of polymerase and substrate (deoxy)ribonucleotide 
triphosphates under conditions allowing for an extension reaction to occur 
when a primer is sufficient complementary to a part of one or more identifier 
oligonucleotides present in the aliquot, and evaluation, based on measure- 
10 ment of the extension reaction, the presence, absence, or relative abundance 
of one or more codons in each compartment 

Item 30: A set comprising a collection of oligonucleotide primers, a poly- 
merase, a composition of (deoxy)ribonucleotide triphosphates (dNTPs), and 

15 a library of complexes composed of a display molecule part and an identifier 
oligonucleotide, said oligonucleotide comprising codons informative of the 
identity of the chemical entities which has participated in the formation of the 
display molecule, wherein the oligonucleotide primers are sufficient comple- 
mentary to codons appearing on the identifier oligo nucleotides in the library 

20 to allow for an extension reaction to occur. 

Item 31: An encoded molecule identified by a method according to Items 1 to 
28. 

25 Method for identifying a display molecule 

The following pages will describe the second aspect of the invention: A 
METHOD FOR IDENTIFYING A DISPLAY MOLECULE, in which various 
patent and non-patent references cited in the present application are hereby 
incorporated by reference in their entirety. It is envisaged that any of the em- 
30 bodiments of the first aspect of the present invention may be used in combi- 
nation with any of the embodiments of the second aspect of the present in- 
vention; indeed any of the features described in relation to the first aspect of 
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the present invention may be used in combination with any of the embodi- 
ments or features of the second aspect of the present invention, and vice 
versa. 

5 The second aspect of the present invention relates to a method for identifying 
from a library a display molecule having affinity towards one or more 
molecular targets. The display molecule is a part of a complex also 
comprising an identifier oligonucleotide that codes for said display molecule. 

10 Traditional drug discovery begins with a pathological phenomenon in an or- 
ganism and the development of a therapeutic theory to combat this. A chemi- 
cal concept follows to produce compounds for screening. Most of the proc- 
esses for curing the pathological phenomenon originate with the understand- 
ing of some biological pathways and screening for an effect in tissues or 

15 cells. This may or may not eventually reveal a "target". The target can be 

identified by various conventional methods, including protein expressing, pro- 
tein chemistry, structure-functional studies, knowledge of biochemical path- 
ways, and genetic studies. 

20 In recent years, genetic information has increasingly guided the identification 
of molecular targets. These are derived from the knowledge of the genes of 
specific cell phenotypes that encode proteins that may be involved in the 
pathogenesis of a particular disease state. 

25 A lead is a compound, usually a small organic molecule that demonstrates a 
desired biological activity on a target. Usually, a collection of compounds, 
referred to as a "library", is screened before a useful lead is identified. Today, 
many libraries are commercially available or open to public. Most pharma- 
ceutical companies house their own compilation of compounds that have 

30 been synthesised over several years and screened against a variety of tar- 
gets. 
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Each compound in a library must be screened by an appropriate assay 
against the target. Techniques for handling the screening of several thou- 
sands compounds simultaneously have been developed and are generally 
referred to as high-throughput screening techniques. To push the limit of 
5 compounds possible toscreen simultaneously, different manufactures have 
been developing instrumentation capable of handling multiple micro titer plate 
formats on the same platform using 384 and 1536-well plates. Advances in 
small volume liquid dispensing and pipetting, reliable handling of standard- 
ized plates and simplified assay formats all have made an impact on the reli- 
10 ability of the high-throughput screening process. 

However, high-throughput screening has the disadvantage that each of the 
compounds has to be positioned in spatially discrete regions, usually in wells 
of a micro titer plate in order to observe an interaction with a target. If more 
15 than a single compound is present, it is not feasible to discern which com- 
pound displaying the appropriate biological activity. Thus, the full power of 
combinatorial chemistry cannot be applied because a collection of com- 
pounds usually is produced in a single container. 

20 To be able to select a possible lead compound in a collection of compounds 
placed in the same container, libraries of bifunctional complexes have been 
evolved. Each bifunctional complex in the library comprises a potential lead 
compound coupled to an identifier oligonucleotide sequence. The identifier 
oligonucleotide sequence is suitably a nucleic acid which identifies the poten- 

25 tial lead compound. When a library of bifunctional complexes is screened 
against a target, one or more of the potential leads may bind to the target. 
After removal of the remainder of the library, the binding bifunctional com- 
plexes can be eluated and the lead compound identified by sequencing the 
identifier oligonucleotide. 

30 

Various techniques for producing bifunctional complexes are known from the 
prior art. Some attempts to form the complex comprising a molecule as well 



WO 2005/026387 PCT/DK2004/000630 

61 

as the identifier oligonucleotide that codes therefore, are based on the split- 
and-mix principle known from combinatorial chemistry, see e.g. WO 
93/06121 A1 , EP 643 778 B1 , and WO 00/23458. 

5 Other attempts have focussed on the formation of encoded proteins using the 
natural machinery of a cell and connecting the formed protein with the tem- 
plate nucleic acid that has coded for the amino, acid components of the pro- 
tein. Examples of suitable systems are phage display, E. coli display, ribo- 
some display (WO 93/03172), and protein-mRNA-fusions (WO 98/31700). 

10 The genetic information of the nucleic acid, usually mRNA or DNA, may not 
necessarily be decoded between each round of selection to establish the 
identity of the chemical entities that has formed the protein because the nu- 
cleic acid can be amplified by known means, such as PCR, and processed 
for the formation of a new library enriched in respect of suitable binding pro- 

15 teins. 

Recently, a method for encoding molecules has been suggested, which can 
be performed in several selection rounds without intermediate decoding, 
wherein the encoded molecule is not restricted to peptides and proteins. WO 

20 02/00419 and WO 02/103008 disclose methods for preparing virtually any 
molecule connected to an identifier oligonucleotide coding for chemical enti- 
ties which have reacted to form the display molecule. In short, a template 
segregated into a plurality of codons and a plurality of building blocks com- 
prising a transferable chemical entity and an anticodon are initially provided. 

25 Under hybridisation conditions, the template and building blocks are an- 
nealed together and the chemical entities are subsequently reacted to form 
the molecule. 

The present invention aims at providing an efficient method for identifying 
30 display molecules having affinity towards a target using a library of Afunc- 
tional complexes. 
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The second aspect of the present invention concerns a method for identifying 
display molecule(s) having affinity towards molecular target(s), comprising 
the steps of 

. mixing one or more molecular target(s) associated with target oligonu- 
5 clebtide(s) and a library of bifunctional complexes, each bifunctional complex 
of the library comprising a display molecule attached to an identifier oligonu- 
cleotide, which codes for said display molecule, 

coupling to the target oligoriucleotide(s) the identifier oligonucleotide 
of complexes comprising display molecules binding to the target, and 
10 deducing the identity of the binding display molecule(s) and/or the mo- 

lecular target(s) from the coupled product between the identifier oligonucleo- 
tide^) and the target oligonucleotide(s). 

The second aspect of the present invention is based on the realization that a 

1 5 close proximity of the identifier oligonucleotides relative to the target oligonu- 
cleotide is obtained when a display molecule has binding affinity towards a 
target and therefore, will be more prone to be coupled to the target oligonu- 
cleotide compared to identifier oligonucleotides of complexes not comprising 
a display molecule having affinity toward the target. The tendency to be cou- 

20 pled together depends on various factors such as (1) the affinity of the dis- 
played molecule towards the target, (2) the length between the displayed 
molecule and the coupling point on the identifier oligonucleotide sequence as 
well as the length between the target molecule and the coupling point on the 
target sequence, and (3) possible steric effects resulting from the nature of 

25 the target or the oligonucleotides. The proximity effect, and thus the power of 
selectivity, will increase with higher local concentration of ends of nucleotides 
to be coupled together. Thus, various embodiments of the present invention 
may be envisaged to fine-tune the local concentration of the ends of the oli- 
gonucleotides, including appropriately selected lengths of target and identifier 

30 oligonucleotides, site of attachment of the target oligonucleotide to the target 
relative to the binding site of the display molecule, and size of target. 
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The molecular target may be of a biological origin or may be a synthetic mo- 
lecular target. Typically, the molecular target stems from an organism se- 
lected from human and animals, especially vertebras. However, in other em- 
bodiments the target may originate from a plant. In the quest for a compound 
5 with therapeutical effect on the human or animal body, the target is usually 
expected to have an importance in a therapeutically theory that combats a 
certain disease. In the quest for discovering compounds with plant protective 
effect, the target is usually expected to originate from an organism that harms 
the crop or a competing undesired plant. The organism may be a fungus 
10 when a compound with fungicide effect is searched for or an insect when a 
compound having insecticide effect is desired. Optionally, a protein target 
stemming from a biological origin may be derivatised by altering, adding, or 
deleting one or more amino acids.. 

15 The molecular target may be a protein, a small molecular hormone, a lipid, a 
polysaccharide, a whole celJ, a nucleic acid, a metabolite, a heme group, etc. 
In a preferred aspect the target is a protein. The protein may serve the func- 
tion in the organism of being an enzyme, a hormone, a structural element, a 
regulatory protein, a membrane channel or pump, a part of a signal transduc- 

20 ing cascade, an antibody, etc. Suitable target enzymes include kinases, 

phosphafates, and proteases. The protein may occur as an independent en- 
tity or may be dimers, trimers, tetramers, or polymers and the protein may 
comprise a prosthetic group. Also, the target may be a fusion protein having 
two or more functionalities. Furthermore, the molecular target may be a solu- 

25 ble or insoluble agglomerate of one or more proteins and one or more sub- 
stituents occurring in the body or artificial components. In another preferred 
embodiment, the molecular target is a nucleic acid, such as DNA or RNA ap- 
tamer or ribozyme. 

30 The target may be immobilized to a support or be present as a solution or a 
emulsion, as appropriate. The target optionally immobilized on the support, 
may also form a stable or quasi-stable dispersion in the media. In a certain 
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embodiment, the target is in solution and all the reactions occur in the solu- 
tion too. The absence of an immobilization step, reduces the background 
noise because there is no background surface to associate to. Thus the re- 
sult of the assay may be more sensitive. In solution, the only background 
5 noise imaginable is when the oligonucleotides or display molecules of the 
library of complexes binds unspecific to the target molecules or the target 
oligonucleotides. The absence of an immobilization step generally necessi- 
tates a subsequent recovery step, eg. by chromatography. 

10 In certain aspects of the invention, it is preferred to immobilize the molecular 
target on a solid support. The solid support may be beads of a column or the 
surface of a container. The immobilisation of the molecular target may ease 
the removal of the non-binding complexes by washing or similar means. In a 
certain embodiment, a cleavable linkage between the molecular target and 

15 the solid support is present. The cleavable linker is preferably selectively 

cleavable, that is, the linkage can be cleaved without cleaving other linkages 
in the target or the complexes. The cleavage of the linkage between the mo- 
lecular target and the solid support may reduce the contribution from the 
background, such as complexes associated with the surface of the solid sup- 

20 port and not binding to the molecular target. 

A single or a multitude of different molecular targets may be mixed with the 
library of complexes. If two or more different targets are mixed with the library 
of complexes it is appropriate to provide the molecular targets with a genetic 

25 sequence coding for the identity of the target in question. Proving the targets 
with identifying oligonucleotides allows for a simultaneous decoding of the 
binding partners, i.e. the molecular target and the display molecule. The si- 
multaneous decoding is not only suitable for finding binding partners. It is 
also valuable for finding a possible cross-binding interaction or to find other 

30 display molecules competing for the same target. Furthermore, appropriate 
selection of display molecules and/or molecular targets, can generate useful 
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information for preparing a structure-activity relationship (usually abbreviated 
SAR). 

The molecular target may be obtained in any suitable way. A variety of tar- 
5 gets are commercially available, either as purified protein or as the corre- 
sponding cDNA. Other protein or peptide targets may be isolated from tis- 
sues or mRNA (or the corresponding cDNA) may be extracted from a tissue. 
Smaller peptides may be synthesised chemically using the standard solid- 
phase Fmoc peptide synthesis. When nucleic acids are used or included in 
10 the molecular target, it may be synthesised using the standard amedite syn- 
thesis method or by using the natural machinery. 

The target can be associated with the target oligonucleotide using any suit- 
able means and the association may involve a covalent or non-covalent link- 

15 age. In an aspect of the invention the oligonucleotide is associated with the 
target utilizing a chemical synthesis. A protein usually comprises several 
groups that may be functionalized and used as attaching point. As examples 
the side chain of lysine contains an amino group, the side chain of serine 
contains a hydroxyl group and the side chain of cystein contains a thiol 

20 group, all of which may serve as anchoring point for a target oligonucleotide 
comprising e.g. a carboxylic acid group. In another aspect of the invention, 
the protein target is fused to a tag, such as a His-tag, Flag-tag, antibody, or 
streptavidin. The tag can be selectively recognized by an anti-body or small 
molecule such as biotin or dinitrophenol. The anti-body or the small molecule 

25 is attached to the target oligonucleotide, thereby ensuring an efficient cou- 
pling of the molecular target to the target oligonucleotide. 

The target oligonucleotide can be associated to the molecular target through 
a cleavable linker. The cleavable linkage can be used to separate the target 
30 oligonucleotide from the molecular target or the coupled product from the 
target at a point in time following the contacting between the target and li- 
brary of complexes. The target oligonucleotide can be distanced from the 
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molecular target by a suitable linker, such as a polyethylene linker or a non- 
coding nucleotide sequence. A linker may remedy any interaction that possi- 
ble can be between the molecular target and the target oligonucleotide and at 
the same time provide suffice space for an enzyme to perform its action. 

5 

A variety or methods for association of an oligonucleotide to a target is avail- 
able for the skilled person in the art. An option involves the association of a 
target protein with the mRNA responsible for the formation thereof. This 
method is generally referred to as mRNA display. Optionally, the mRNA may 

10 be substituted with the corresponding cDNA. A method for generation such a 
single or a library of fusions between a protein and the mRNA responsible for 
the formation thereof is disclosed in WO 98/31700. The corresponding DNA 
strand may be attached to the protein using the method disclosed in WO 
00/32823. The contents of both patent applications being incorporated in 

15 their entirety by reference herein. The method of WO 98/31700 includes pro- 
viding a RNA stand comprising a translation initiation sequence, a start 
codon operable linked to a protein encoding sequence, and a peptide accep- 
tor at the 3' end and translating the protein encoding sequence to produce a 
RNA-protein fusion. According to WO 00/32823 a DNA primer is covalently 

20 connected to the 3' end of the mRNA strand and extended by reverse tran- 
scriptase a to prepare the complementing DNA strand. The original RNA 
strand may be digested by RNase H. Another suitable method for generating 
a target library is disclosed in WO 01/90414, the content of which is incorpo- 
rated herein by reference. 

25 

In accordance with another option, the target oligonucleotide is associated 
with a target using a method generally referred to as ribosome display. Ribo- 
some display is disclosed in WO 93/03172, the content of which is included 
herein by reference. A still further option for association is a variation of the 
30 ■ phage display, in which a target is displayed on the capsule of the phage and 
a target oligonucleotide is connected to the same capsule. Suitably, the tar- 
get oligonucleotide is connected to the capsule via reactive groups positioned 
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on proteins expressed on the capsule. A suitable reactive group is -SH ema- 
nating from cystein. 

A further option for associating the target with an oligonucleotide includes the 
5 method disclosed in M. Yonezawa et at, Nucleic acid research, 2003, vol. 31 , 
No. 19 e118 (included by reference). The method includes the initial provi- 
sion of a target oligonucleotide connected to biotin and compartmentalization 
thereof together with a transcription and translation system. The target oli- 
gonucleotide comprises a fusion gene coding for streptavidin and a target. 
1 0 After the formation of the fusion protein in each compartment, the streptavidin 
part of the fusion protein binds to the biotin moiety of the target oligonucleo- 
tide, thereby associating the target with the target oligonucleotide. 

In case the target is a nucleic acid, it may be of the aptamer type, i.e. an ap- 
1 5 tamer or a library of aptamers comprising constant nucleic acid regions flank- 
ing a random oligonucleotide part. The random oligonucleotide part serves 
the function of a molecular target in the present invention and one or both the 
• constant region serves as target oligonucleotides. Alternatively, an additional 
oligonucleotide sequence can be adhered to one or both the constant regions 
20 to serve as the target oligonucleotide. The present invention also allows for 
the selection of pairs of aptamers that either binds to each other or binds to 
the same target but to different epitopes. The latter embodiment is of particu- 
lar relevance when evolving detection assays, where aptamers that bind to 
different epitopes on the same target may be desired. 

25 

In certain embodiments, a bifunctional complex having a display molecule 
binding to the molecular target constitutes the target oligonucleotide associ- 
ated with the molecular target. The bifunctional complexes binding to the tar- 
get and serving to associate an oligonucleotide to the molecular target may 
30 be provided prior to or during with the mixing step. In case the target oligonu- 
cleotide is associated with the molecular target during the mixing step the 
bifunctional complex may be a member of the library or may be a complex 
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added to the mixture. In some aspects of the invention, the display molecule 
is a compound known to bind to the target. The display molecule may be 
known to bind to the target from the prior art or from preceding screening 
procedures. To find a second or further binding compound, the target is gen- 
5 erally saturated with the known display molecule prior to the mixing step. The 
known binding molecule may or may not be attached to a nucleic acid se- 
quence. 

In case the target oligonucleotide emanates from an identifier oligonucleotide 
10 in the library of complexes, two Afunctional complexes of a library of Afunc- 
tional complexes are associated with a common molecular target and can be 
discovered simultaneously. The bifunctional complexes may bind to the same 
binding site of the molecular target or the bifunctional complexes may bind to 
different binding sites. In case the bifunctional complexes bind to the same 
15 binding sites, the display molecules may be connected to each other through 
a covalent or non-covalent chemical bond, e.g. by a technique known as click 
chemistry. By way of example only, the two display molecules can be con- 
nected through a disulphide bridge. In aspects of the invention in which the 
display molecules binds to discrete binding domains on the same target 
20 molecule they are usually adhered together after the identification process of 
the present invention by a suitable linker in order to form the effective com- 
pound. 

In a certain aspects of the invention an iterative method for finding the de- 
25 sired compound is applied. An initial display molecule or a pool of display 

molecules with affinity towards a target is found using the present method or 
another method. In the event the affinity is in the lower range of what is de- 
sired, the initial display molecule is amended by reaction with one or more 
chemical entities to form a second generation library, said second generation 
30 library being used in the method of the present invention. 
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In Nature, biochemical components interact. In a certain aspect of the pre- 
sent invention, two or more targets interacting in a biological context are 
separately subjected to the method of the invention, whereupon the identified 
display molecules binding to the two or more targets are linked via a suitable 
5 linker. By way of example only, blood factors, such as factor Vila and Xa may 
be prevented or promoted in the their interaction with each other using a 
compound having a part binding to a first blood factor and a part binding to a 
second blood factor. In other aspects of the invention, targets are linked by a 
suitable linker, as disclosed in Fig. 10. 

10 

A library of molecular targets may be generated by starting out from a library 
of DNA molecules, usually cDNA molecules, and preparing the correspond- 
ing RNA strands by a suitable RNA polymerase. In according with a certain 
aspect of the invention, the mixture step therefore includes that a molecular 
15 target library comprising different peptides each attached to the nucleic re- 
sponsible for the formation thereof is mixed with a library of complexes. 

The coupling of the target oligonucleotide and the identifier oligonucleotide is 
. promoted due to the relative high local concentration of the ends of oligonu- 
20 cleotides. The complexes in solution, i.e. complexes not bund to the target, 
are relatively remote from the target oligonucleotide and the tendency of the 
target oligonucleotides to be coupled to such unspecific complexes is re- 
duced. Among other things, the unspecific coupling depends on the concen- 
tration of the complexes in solution. 

25 

In an aspect of the invention the non-binding library members are either di- 
luted or at least partly removed prior to the coupling step. Dilution may be 
performed by adding a suitable buffer and the removal may be preformed by 
washing one or more times with a suitable liquid, such as a buffer. In the 
30 event the target molecule is immobilized on a bead, the beads may be re- 
tained on a filter while non-binding complexes is transported through the fil- 
ter. In case the target is in solution or made in solution by cleavage of a bond 
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immobilising the target, optional following one or more washing step, the re- 
moval can be performed by chromatography, such as size-exclusion chroma- 
tography. 

5 Thus, in a certain aspect of the invention, the mixing step includes the re- 
moval of non-binding library members prior to the coupling of the target oli- 
gonucleotide and the identifier nucleotide together- 
It may be an advantage to have all or at least a part of the nucleotides on a 
10 double stranded form during the contacting with the molecular target, as cer- 
tain nucleic acids may perform a binding interaction or a catalytical action on 
the components present during the mixing step. Thus, in one embodiment of 
the invention, the target oligonucleotide and/or the identifier oligonucleotide 
partly or fully is hybridised to a complementing oligonucleotide. 

15 

The coupling may be performed using any suitable means that ensures a 
physical connection. Suitably, the coupling is performed using means se- 
lected from the group consisting of chemical means, enzymatic means, and 
design means. 

20 

The chemical means for coupling the ends together can be selected from a 
large plethora. Suitable examples include that 

a) a first oligonucleotide end comprises a 3-OH group and the second oli- 
gonucleotide end comprises a 5'-phosphor-2-imidazole group. When reacted 

25 a phosphodiester intemucleoside linkage is formed, 

b) a first oligonucleotide end comprising a phosphoimidazolide group and the 
3-end and a phosphoimidazolide group at the 5'-and. When reacted together 
a phosphodiester intemucleoside linkage is formed, 

c) a first oligonucleotide end comprising a 3-phosphorothioate group and a 
30 second oligonucleotide comprising a 5-iodine. When the two groups are re- 
acted a 3'-0-P(=0)(OH)-S-5' intemucleoside linkage is formed, and 
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d) a first oligonucleotide end comprising a 3-phosphorothioate group and a 
second oligonucleotide comprising a 5-tosylate. When reacted a 3-0- 
P(=0)(OH)-S-5' intemucleoside linkage is formed. 

5 Suitably, the target oligonucleotide or a complementing target oligonucleotide 
and the indentifier oligonucleotide or a complementing identifier oligonucleo- 
tide operatively are joined together, so that as to allow a nucleic acid active 
enzyme to recognize the coupling area a substrate. Notably, in a preferred 
embodiment, the coupling is performed so as to allow a polymerase to rec- 
ognise the coupled strand as a template. Thus, in a preferred aspect, a 
chemical reaction strategy for the coupling step generally includes the forma- 
tion of a phosphodiester intemucleoside linkage. In accordance with this as- 
pect, method a) and b) above is preferred. 

The enzymatic means is in some instances preferred because the coupling 
reaction is specific, i.e. the risk of side reactions in virtually not present. The 
enzymatic means is in general selected from the enzymes of the polymerase 
type, ligase type, and restriction enzymes, as well as any combination 
thereof. 



Ligases are useful means for the coupling step. Suitable examples include 
Taq DNA ligase, T4 DNA ligase, T7 DNA ligase, and E. coli DNA ligase. The 
choice of the ligase depends to a certain degree on the design of the ends to 
be joined together. Thus, if the ends are blunt, T4 DNA ligase may be pre- 
ferred, while a Taq DNA ligase may be preferred for a sticky end ligation, i.e. 
a ligation in which an overhang on each end is a complement to each other. 

In a certain aspect of the invention, a connector oligonucleotide is used. The 
connector oligonucleotide has a region complementing a distal part of the 
target oligonucleotide and a region complementing a distal part of the identi- 
fier oligonucleotide. If the ends of the target and identifier oligonucleotides 
abut each other a ligase can ligate the ends together. If a gap exists between 



WO 2005/026387 PCT/DK2004/000630 

72 

the ends, a polymerase may be used to fill the gap and a ligase may subse- 
quently perform a ligation. The regions of the connector oligonucleotide com- 
plementing the identifier oligonucleotide and target oligonucleotide, respec- 
tively, may independently be chose, e.g. in the range of 6 to 16 nucleotides, 
5 preferably in the range of 8 to 12 nucleotides. In a particular aspect of the 

invention the connector oligonucleotide is added in excess relative to the total 
target and identifier oligonucleotides to saturate the ends of complexes not 
bound to a target. 

10 In another aspect of the invention the coupling is performed by design 

means. As an example, the regions at the distal ends of the target and identi- 
fier oligonucleotides are designed to be complementary to each other. Under 
hybridisation conditions polymerase is then allowed to extend the target oli- 
gonucleotide as well as the identifier oligonucleotide to obtain a double 

15 stranded coupled product. 

Still another coupling means include the design of the ends of one or more 
oligonucleotides with sticky ends. In a certain aspect, the target oligonucleo- 
tide and/or the identifier oligonucleotide is provided with a sticky end to allow 

20 a ligase or a polymerase or a mixture thereof to adjoin the oligonucleotides. 
Suitably, the sticky ends can be formed by a restriction nuclease. In a practi- 
cal approach, the target and the identifier oligonucleotides are initially double 
stranded and provide at the ends with a restriction site. Following the initial 
contact, the mixture is treated with a restriction nuclease to form the sticky 

25 ends. After subsequent removal of the restriction enzyme from the mixture, a 
ligase is allowed to perform its enzymatic action to form a ligation product. 

The coupled product of the identifier and target oligonucleotides comprises 
the information necessary for decoding the identity of the display molecule 
30 and optionally also of the molecular target. The coupled product may be ana- 
lysed directly in some instances to reveal the identity of the display molecules 
that have performed an interaction with the molecular target. As an example, 
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the coupled product may be detached from the target-display molecule inter- 
action using a cut by a restriction nuclease at positions of the coupled prod- 
uct flanking the informative part. The informative part can be decoded in a 
standard sequencing machine. In general however, it is preferred to include 
5 the informative part of the coupled product in to a suitable vector and transfer 
the vector to a host organism. The host organisms may then be cultivated on 
a suitable substrate and allowed to form colonies. Samples from the colonies 
may be used for sequencing in a sequencing machine. 

10 In another approach, the target and/or the identifier oligonucleotides or se- 
quences complementary thereto at the proximal end is provided with a prim- 
ing site. The priming site may be used for annealing a primer to allow a po- 
lymerase to extent the primer using the coupled product as template. Appro- 
priately, the extended strand is designed with another priming site which may 

15 allow a second (reverse) primer to anneal thereto and subsequently a poly- 
merase to perform an extension of the primer to produce a sequence identi- 
cal to the coupled product. The extension product or the amplification product 
may be analysed directly or be incorporated into a vector which subsequently 
is transformed into a host organism as explained above. In a certain aspect 

20 of the invention, the coupled oligonucleotide is amplified by PCR using prim- 
ing sites positioned proximal to the display molecule and the molecular tar- 
get, respectively, and flanking the informative part of the coupled product. 



Following the coupling step, the target-display molecule conjugate may be 
recovered. Any method that result in a recovery may in principle be used, 
including filtering, washing, elution, chromatography, etc. In a preferred as- 
pect the target-display molecule conjugate is recovered by chromatography 
following the coupling of the target and the identifier oligonucleotides. Op- 
tionally, the various recovering methods may be combined. As an example, a 
washing or elution step may precede the chromatography step. A presently 
preferred chromatography method is size-exclusion chromatography. The 
chromatography is usually performed on a sample in which the target-display 
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molecule conjugate is in solution. In one aspect of the invention, the target 
has been cleaved from a solid support prior to the chromatography step. 

In an aspect of the invention, selective cleavable chemical moieties at each 
5 end of the coupled oligonucleotides are cleaved to liberate the coupled 

oligonucleotides prior to amplification. Usually, the cleavage is preceded of a 
step removing the non-binding complexes. The liberated product may be re- 
covered as described above and subjected to amplification. 

10 The invention also pertains to a conjugate comprising a molecular target as- 
sociated with an oligonucleotide and a Afunctional complex comprising a 
display molecule attached to an identifier oligonucleotide, which codes for 
said molecule. Usually, the display molecule part of the complex is bound to 
the target. The target oligonucleotide and/or the identifier oligonucleotide of 

15 the conjugate are in a certain embodiment joined to the molecular target 
and/or the display molecule, respectively, through a selectively cleavable 
linkage. In an embodiment of the invention, the target oligonucleotide and the 
identifier oligonucleotide are coupled together, whereby a product is obtained 
wherein a display molecule is bound to the molecular target and the identifier 

20 and target oligonucleotides are coupled together. In a certain aspect, the 
coupled oligonucleotide is amplifiable. The amplifiability is usually obtained 
through a coupling that involves a phospodiester internucleoside linkage. 

The present invention also extends to a display molecule identified by any of 
25 the methods disclosed herein. 



Detailed Description 

Complex 

The complex used in the present invention comprises a display molecule and 
30 an identifier oligonucleotide. The identifier oligonucleotide comprises identify- 
ing moieties that identifies the display molecule. Preferably, the identifier oli- 
gonucleotide identifies the molecule uniquely, i.e. in a library of complexes a 
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particular identifier oligonucleotide is capable of distinguishing the molecule it 
is attached to from the rest of the display molecules. 

The display molecule and the identifier oligonucleotide may be attached di- 
5 rectly to each other or through a bridging moiety. In one aspect of the inven- 
tion, the bridging moiety is a selectively cleavable linkage. 

The method may in certain embodiments be performed without amplification 
after the coupling step. However, when larger libraries are used and the 
10 amount of separated coupled product oligonucleotide is relatively low, it is in 
general preferred to use an identifier oligonucleotide and a coupled prodct 
which is amplifiable. Identifier oligonucleotides comprising a sequence of nu- 
cleotides may be amplified using standard techniques, like PCR. 

1 5 The identifier oligonucleotide may comprise two or more codons. The se- 
quence of codons can be decoded to identify reactants used in the formation 
of the molecule. When the identifier oligonucleotide comprises more than one 
codon, each member of a pool of chemical entities can be identified and the 
order of codons is informative of the synthesis step each member has been 

20 incorporated in. 



The sequence of the nucleotides in each codon may have any suitable 
length. The codon may be a single nucleotide or a plurality of nucleotides. In 
some aspects of the invention, it is preferred that each codon independently 
comprises four or more nucleotides, more preferred 4 to 30 nucleotides. 

The identifier oligonucleotide will in general have at least two codons ar- 
ranged in sequence, i.e. next to each other. Two neighbouring codons may 
be separated by a framing sequence. Depending on the encoded molecule 
formed, the identifier oligonucleotide may comprise further codons, such as 
3, 4, 5, or more codons. Each of the further codons may be separated by a 
suitable framing sequence. Preferably, all or at least a majority of the codons 
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of the identifier oligonucleotide are separated from a neighbouring codon by 
a framing sequence. The framing sequence may have any suitable number 
of nucleotides, e.g. 1 to 20. Alternatively, coddns on the identifier oligonu- 
cleotide may be designed with overlapping sequences. 

5 

The framing sequence, if present, may serve various purposes. In one setup 
of the invention, the framing sequence identifies the position of the codon. 
Usually, the framing sequence either upstream or downstream of a codon 
comprises information which allows determination of the position of the 

10 codons. In another setup, the frames have alternating sequences, allowing 
for addition of building blocks from two pools in the formation of the library. 
The framing sequence may also or in addition provide for a region of high 
affinity. The high affinity region may ensure that the hybridisation of the tem- 
plate with an anti-codon will occur in frame. Moreover, the framing sequence 

15 may adjust the annealing temperature to a desired level. 



A framing sequence with high affinity can be provided by incorporation of one 
or more nucleobases forming three hydrogen bonds to a cognate nucleo- 
base. Examples of nucleobases having this property are guanine and cyto- 
sine. Alternatively, or in addition, the framing sequence may be subjected to 
backbone modification. Several back bone modifications provides for higher 
affinity, such as 2-O-methyl substitution of the ribose moiety, peptide nucleic 
acids (PNA), and 2*-4' O-methylene cyclisation of the ribose moiety, also re- 
ferred to as LNA (Locked Nucleic Acid). 

The identifier oligonucleotide may comprise one or two flanking regions. The 
flanking region can encompass a signal group, such as a flourophor or a ra- 
dio active group to allow for detection of the presence or absence of a com- 
plex or the flanking region may comprise a label that may be detected, such 
as biotin. When the identifier oligonucleotide comprises a biotin moiety, the 
identifier oligonucleotide may easily be recovered. 
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The flanking region(s) can also serve as priming sites for amplification reac- 
tions, such as PCR. The identifier oligonucleotide may in certain embodi- 
ments comprise an affinity region having the property of being able to hybrid- 
ise to a building block. 

5 

It is to be understood that when the term identifier oligonucleotide is used in 
the present description and claims, the identifier oligonucleotide may be in 
the sense or the anti-sense format, i.e. the identifier oligonucleotide can be a 
sequence of codons which actually codes for the molecule or can be a se- 
10 quence complementary thereto. Moreover, the identifier oligonucleotide may 
be single-stranded or double-stranded, as appropriate. 

The display molecule part of the complex is generally of a structure expected 
of having an effect on the target. When the target is of pharmaceutical impor- 

15 tance, the molecule is generally a possible drug candidate. The complex may 
be formed by tagging a library of different possible drug candidates with a 
tag, e.g. a nucleic acid tag identifying each possible drug candidate. In an- 
other embodiment of the invention, the molecule is encoded, i.e. formed by a 
variety of reactants which have reacted with each other and/or a scaffold 

20 molecule. Optionally, this reaction product may be post-modified to obtain the 
final molecule displayed on the complex. The post-modification may involve 
the cleavage of one or more chemical bonds attaching the encoded molecule 
to the indentifier in order more efficiently to display the encoded molecule. In 
still another embodiment the display molecule is a polypeptide formed using 

25 the natural machinery, such as the methods disclosed in WO 92/02536, WO 
91/05058, and US 6,194,550. 

The formation of a synthetic encoded molecule generally starts by a scaffold, 
i.e. a chemical unit having one or more reactive groups capable of forming a 
30 connection to another reactive group positioned on a chemical entity, thereby 
generating an addition to the original scaffold. A second chemical entity may 
react with a reactive group also appearing on the original scaffold or a 
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reactive group incorporated by the first chemical entity. Further chemical 
entities may be involved in the formation of the final reaction product. The 
formation of a connection between the chemical entity and the nascent 
encoded molecule may be mediated by a bridging molecule. As an example, 
5 if the nascent encoded molecule and the chemical entity both comprise an 
amine group a connection between these can be mediated by a dicarboxylic 
acid. A display molecule is in general produced in vitro and may be a 
naturally occurring or an artificial substance. In an aspect of the invention, a 
display molecule is not produced using the naturally translation system in an 
10 in vitro process. In other aspects of the invention, the display molecule is a 
polypeptide produced using the natural translation machinery. 

The chemical entities that are precursors for structural additions or 
eliminations of the encoded molecule may be attached to a building block 
15 prior to the participation in the formation of the reaction product leading to the 
final display molecule. Besides the chemical entity, the building block 
generally comprises an anti-codon. In some embodiments the building blocks 
also comprise an affinity region providing for affinity towards the nascent 
complex. 

20 

Thus, the chemical entities are suitably mediated to the nascent encoded 
molecule by a building block, which further comprises an anticodon. The anti- 
codon serves the function of transferring the genetic information of the 
building block in conjunction with the transfer of a chemical entity. The 

25 transfer of genetic information and chemical entity may occur in any order, 
however, it is important that a correspondence is maintained in the complex. 
The chemical entities are preferably reacted without enzymatic interaction in 
some aspects of the invention. Notably, the reaction of the chemical entities 
is preferably not mediated by ribosomes or enzymes having similar activity. 

30 In another aspect of the invention a ribosome is used to translate an mRNA 
into a protein using a tRNA loaded with a natural or unnatural amino acid. In 
still another aspect of the invention, enzymes having catalytic activities 
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different from that of ribosomes are used in the formation of the display 
molecule. 

According to certain aspects of the invention the genetic information of the 
5 anti-codon is transferred by specific hybridisation to a codon on a nucleic 

acid template. Another method for transferring the genetic information of the 
anti-codon to the nascent complex is to anneal an oligonucleotide 
complementary to the anti-codon and attach this oligonucleotide to the 
complex, e.g. by ligation. A still further method involves transferring the 
10 genetic information of the anti-codon to the nascent complex by an extension 
reaction using a polymerase and a mixture of dNTPs. 



The chemical entity of the building block may in certain cases be regarded as 
a precursor for the structural entity eventually incorporated into the encoded 

15 molecule. In other cases the chemical entity provides for the eliminations of 
chemical units of the nascent encoded molecule. Therefore, when it in the 
present application with claims is stated that a chemical entity is reacted with 
a nascent encoded molecule it is to be understood that not necessarily all the 
atoms of the original chemical entity is to be found in the eventually formed 

20 encoded molecule. Also, as a consequence of the reactions involved in the 
connection, the structure of the chemical entity can be changed when it ap- 
pears on the nascent encoded molecule. Especially, the cleavage resulting in 
the release of the entity may generate a reactive group which in a subse- 
quent step can participate in the formation of a connection between a nas- 

25 cent complex and a chemical entity. 



The chemical entity of the building block comprises at least one reactive 
group capable of participating in a reaction which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 
30 a scaffold associated with the nascent complex. The number of reactive 

groups which appears on the chemical entity is suitably one to ten. A building 
block featuring only one reactive group is used La. in the end positions of 
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polymers or scaffolds, whereas building blocks having two reactive groups 
are suitable for the formation of the body part of a polymer or scaffolds capa- 
ble of being reacted further. One, two or more reactive groups intended for 
the formation of connections, are typically present on scaffolds. Non-limiting 
5 examples of scaffolds are opiates, steroids, benzodiazepines, hydantoines, 
and peptidylphosphonates. 

The reactive group of the chemical entity may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
10 of the building block may be capable of forming a connection to a reactive 
group of the nascent complex through a bridging fill-in group. It is to be un- 
derstood that not all the atoms of a reactive group are necessarily maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection. 

15 

The subsequent cleavage step to release the chemical entity from the build- 
ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a chemical reagent or an enzyme. The 
cleavage results in a transfer of the chemical entity to the nascent encoded 

20 molecule or in a transfer of the nascent encoded molecule to the chemical 
entity of the building block. In some cases it may be advantageous to intro- 
duce new chemical groups as a consequence of linker cleavage. The new 
chemical groups may be used for further reaction in a subsequent cycle, ei- 
ther directly or after having been activated. In other cases it is desirable that 

25 no trace of the linker remains after the cleavage. 

In another aspect, the connection and the cleavage is conducted as a simul- 
taneous reaction, i.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction. In some as- 
30 pects of the invention, it is appropriate to design the system such that the 
connection and the cleavage occur simultaneously because this will reduce 
the number of steps and the complexity. The simultaneous connection and 
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cleavage can also be designed such that either no trace of the linker remains 
or such that a new chemical group for further reaction is introduced, as de- 
scribed above. 

5 The attachment of the chemical entity to the building block, optionally via a 
suitable spacer can be at any entity available for attachment, e.g. the chemi- 
cal entity can be attached to a nucleobase or the backbone. In general, it is 
preferred to attach the chemical entity at the phosphor of the internucleoside 
linkage or at the nucleobase. When the nucleobase is used for attachment of 

10 the chemical entity, the attachment point is usually at the 7 position of the 

purines or 7-deaza-purins or at the 5 position of pyrimidines. The nucleotide 
may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational space 
sampled by the reactive group is optimized for a reaction with the reactive 

15 group of the nascent encoded molecule. 

The display molecules of the invention may have any chemical structure. In a 
preferred aspect, the display molecule can be any compound that may be 
synthesized in a component-by-component fashion. In some aspects the 
20 display molecule is a linear or branched polymer. In another aspect the 

display molecule is a scaffolded molecule. The term "display molecule" also 
comprises naturally occurring molecules like a-polypeptides etc, however 
produced in vitro usually in the absence of enzymes, like ribosomes. In 
certain aspects, the display molecule of the library is a non-a-polypeptide. 

25 

The display molecule may have any molecular weight. However, in order to 
be orally available, it is in this case preferred that the display molecule has a 
molecular weight less than 2000 Daltons, preferably less than 1000 Dalton, 
and more preferred less than 500 Daltons. 

30 

The size of the library may vary considerably pending on the expected result 
of the inventive method. In some aspects, it may be sufficient that the library 



WO 2005/026387 PCT/DK2004/000630 

82 

comprises two, three, or four different complexes. However, in most events, 
more than two different complexes are desired to obtain a higher diversity. In 
some aspects, the library comprises 1 ,000 or more different complexes, more 
preferred 1 ,000,000 or more different complexes. The upper limit for the size 
5 of the library is only restricted by the size of the vessel in which the library is 
comprised. It may be calculated that a vial may comprise up to 10 14 different 
complexes. 

Methods for forming libraries of complexes 

10 The complexes comprising an identifier oligonucleotide having two or more 
codons that codes for reactants that have reacted in the formation of the 
molecule part of the complex may be formed by a variety of processes. Gen- 
erally, the preferred methods can be used for the formation of virtually any 
kind of encode molecule. Suitable examples of processes include prior art 

15 methods disclosed in WO 93/20242, WO 93/06121 , WO 00/23458, WO 
02/074929, and WO 02/103008, the content of which being incorporated 
herein by reference as well as methods of the present applicant not yet public 
available, including the methods disclosed in DK PA 2002 01955 filed 19 De- 
cember 2002, and DK PA 2003 00430 filed 20 March 2003. Any of these 

20 methods may be used, and the entire content of the patent applications are 
included herein by reference. 

Below four preferred embodiments are described. A first embodiment 
disclosed in more detail in WO 02/103008 is based on the use of a 

25 polymerase to incorporate unnatural nucleotides as building blocks. Initially, a 
plurality of template oligonucleotides is provided. Subsequently primers are 
annealed to each of the templates and a polymerase is extending the primer 
using nucleotide derivatives which have appended chemical entities. 
Subsequent to or simultaneously with the incorporation of the nucleotide 

30 derivatives, the chemical entities are reacted to form a reaction product. The 
encoded molecule may be post-modified by cleaving some of the linking 
moieties to better present the encoded molecule. 
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Several possible reaction approaches for the chemical entities are apparent. 
First, the nucleotide derivatives can be incorporated and the chemical entities 
subsequently polymerised. In the event the chemical entities each carry two 
5 reactive groups, the chemical entities can be attached to adjacent chemical 
entities by a reaction of these reactive groups. Exemplary of the reactive 
groups are amine and carboxylic acid, which upon reaction form an amide 
bond. Adjacent chemical entities can also be linked together using a linking 
or bridging moiety. Exemplary of this approach is the linking of two chemical 

10 entities each bearing an amine group by a bi-carboxylic acid. Yet another 
approach is the use of a reactive group between a chemical entity and the 
nucleotide building block, such as an ester or a thioester group. An adjacent 
building block having a reactive group such as an amine may cleave the 
interspaced reactive group to obtain a linkage to the chemical entity, e.g. by 

15 an amide linking group. 



A second embodiment for obtainment of complexes pertains to the use of 
hybridisation of building blocks to a template and reaction of chemical entities 
attached to the building blocks in order to obtain a reaction product. This 
approach comprises that templates are contacted with a plurality of building 
blocks, wherein each building block comprises an anti-codon and a chemical 
entity. The anti-codons are designed such that they recognise a sequence, 
i.e. a codon, on the template. Subsequent to the annealing of the anti-codon 
and the codon to each other a reaction of the chemical entity is effected. 

The template may be associated with a scaffold. Building blocks bringing 
chemical entities in may be added sequentially or simultaneously and a 
reaction of the reactive group of the chemical entity may be effected at any 
time after the annealing of the building blocks to the template. 

A third embodiment for the generation of a complex includes chemical or 
enzymatical ligation of building blocks when these are lined up on a template. 
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Initially, templates are provided, each having one or more codons. The 
templates are contacted with building blocks comprising anti-codons linked to 
chemical entities. The two or more anti-codons annealed on a template are 
subsequently ligated to each other and a reaction of the chemical entities is 
5 effected to obtain a reaction product. The method is disclosed in more detail 
in DK PA 2003 00430 filed 20 March 2003. 

A fourth embodiment makes use of the extension by a polymerase of an 
affinity sequence of the nascent complex to transfer the anti-codon of a 

10 building block to the nascent complex. The method implies that a nascent 

complex comprising a scaffold and an affinity region is annealed to a building 
block comprising a region complementary to the affinity section. 
Subsequently the anti-codon region of the building block is transferred to the 
nascent complex by a polymerase. The transfer of the chemical entity may 

15 be transferred prior to, simultaneously with or subsequent to the transfer of 
the anti-codon. This method is disclosed in detail in DK PA 2002 01955 filed 
19 December 2002 and DK PA 2003 01064, filed 1 1 July 2003. 

Thus, the codons are either pre-made into one or more templates before the 
20 encoded molecules are generated or the codons are transferred 
simultaneously with the formation of the encoded molecules. 

After or simultaneously with the formation of the reaction product some of the 
linkers to the template may be cleaved, however at least one linker must be 
25 maintained to provide for the complex. 

Nucleotides 

The nucleotides used in the present invention may be linked together in a 
sequence of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is 
30 normally composed of two parts, namely a nucleobase moiety, and a back- 
bone. The back bone may in some cases be subdivided into a sugar moiety 
and an intemucleoside linker. 
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The nucleobase moiety may be selected among naturally occurring nucleo- 
bases as well as non-naturally occurring nucleobases. Thus, "nucleobase" 
includes not only the known purine and pyrimidine hetero-cycles, but also 
5 heterocyclic analogues and tautomers thereof. Illustrative examples of nu- 
cleobases are adenine, guanine, thymine, cytosine, uracil, purine, xanthine, 
diaminopurine, 8-oxo-N 6 -methyladenine, 7-deazaxanthine, 7-deazaguanine, 
N 4 ,N 4 -ethanocytosin, N 6 ,N 6 -ethano-2,6-diamino-purine, 5-methylcytosine, 5- 
(C 3 -C 6 )-alkynylcytosine, 5-fluorouracil, 5-bromouracil, pseudoisocytosine, 2- 

10 hydroxy-5-methyl-4-triazolopyridine, isocytosine, isoguanine, inosine and the 
"non-naturally occurring" nucleobases described in Benner et al., U.S. Pat 
No. 5,432,272. The term "nucleobase" is intended to cover these examples 
as well as analogues and tautomers thereof. Especially interesting nucleo- 
bases are adenine, guanine, thymine, cytosine, 5-methyicytosine, and uracil, 

15 which are considered as the naturally occurring nucleobases in relation to 
therapeutic and diagnostic application in humans. 

Examples of suitable specific pairs of nucleobases are shown on p27 of this 
application, entitled "Natural base pairs", "synthetic base pairs", "synthetic 
20 purine bases pairing with natural pyrimidines". 

Suitable examples of backbone units are shown diagrammatically on p28 of 
this application (B denotes a nucleobase). The sugar moiety of the backbone 
is suitably a pentose but may be the appropriate part of an PNA or a six- 
25 member ring. Suitable examples of possible pentoses include ribose, 2'- 
deoxyribose, 2'-0-methyl-ribose, 2'-flour-ribose, and 2'-4'-0-methylene- 
ribose (LNA). Suitably the nucleobase is attached to the V position of the 
pentose entity. 

30 An intemucleoside linker connects the 3' end of preceding monomer to a 5' 
end of a succeeding monomer when the sugar moiety of the backbone is a 
pentose, like ribose or 2-deoxyribose. The intemucleoside linkage may be 
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the natural occurring phospodiester linkage or a derivative thereof. Exam- 
ples of such derivatives include phosphorothioate, methylphosphonate, 
phosphoramidate, phosphotriester, and phosphodithioate. Furthermore, the 
internucleoside linker can be any of a number of non-phosphorous-containing 
5 linkers known in the art. 



Preferred nucleic acid monomers include naturally occurring nucleosides 
forming part of the DNA as well as the RNA family connected through phos- 
phodiester linkages. The members of the DNA family include de- 

10 oxyadenosine, deoxyguanosine, deoxythymidine, and deoxycytidine. The 

members of the RNA family include adenosine, guanosine, uridine, cytidine, 
and inosine. Inosine is a non-specific pairing nucleoside and may be used as 
universal base because inosine can pair nearly isoenergetically with A, T, 
and C. Other compounds having the same ability of non-specifically base- 

15 pairing with natural nucleobases have been formed. Suitable compounds 
which may be utilized in the present invention includes among others the 
compounds depicted on p30 of this application, entitled "Examples of Univer- 
sal Bases". 

20 Building block 

The chemical entities that are precursors for structural additions or elimina- 
tions of the encoded molecule may be attached to a building block prior to 
the participation in the formation of the reaction product leading the final en- 
coded molecule. Besides the chemical entity, the building block generally 

25 comprises an anti-codon. 

The chemical entity of the building block comprises at least one reactive 
group capable of participating in a reaction which results in a connection be- 
tween the chemical entity of the building block and another chemical entity or 
30 a scaffold associated with the nascent complex. The connection is facilitated 
by one or more reactive groups of the chemical entity. The number of reac- 
tive groups which appear on the chemical entity is suitably one to ten. A 
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building block featuring only one reactive group is used i.a. in the end posi- 
tions of polymers or scaffolds, whereas building blocks having two reactive 
groups are suitable for the formation of the body part of a polymer or scaf- 
folds capable of being reacted further. One, two or more reactive groups in- 
5 tended for the formation of connections, are typically present on scaffolds. 

The reactive group of the building block may be capable of forming a direct 
connection to a reactive group of the nascent complex or the reactive group 
of the building block may be capable of forming a connection to a reactive 
10 group of the nascent complex through a bridging fill-in group. It is to be un- 
derstood that not all the atoms of a reactive group are necessarily maintained 
in the connection formed. Rather, the reactive groups are to be regarded as 
precursors for the structure of the connection. 



1 5 The subsequent cleavage step to release the chemical entity from the build- 
ing block can be performed in any appropriate way. In an aspect of the inven- 
tion the cleavage involves usage of a reagent or an enzyme. The cleavage 
results in a transfer of the chemical entity to the nascent encoded molecule 
or in a transfer of the nascent encoded molecule to the chemical entity of the 

20 building block. In some cases it may be advantageous to introduce new 
chemical groups as a consequence of linker cleavage. The new chemical 
groups may be used for further reaction in a subsequent cycle, either directly 
or after having been activated. In other cases it is desirable that no trace of 
the linker remains after the cleavage. 

25 

In another aspect, the connection and the cleavage is conducted as a simul- 
taneous reaction, i.e. either the chemical entity of the building block or the 
nascent encoded molecule is a leaving group of the reaction. In general, it is 
preferred to design the system such that the connection and the cleavage 
30 occur simultaneously because this will reduce the number of steps and the 

complexity. The simultaneous connection and cleavage can also be designed 
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such that either no trace of the linker remains or such that a new chemical 
group for further reaction is introduced, as described above. 

The attachment of the chemical entity to the building block, optionally via a 
5 suitable spacer can be at any entity available for attachment, e.g. the chemi- 
cal entity can be attached to a nucleobase or the backbone. In general, it is 
preferred to attach the chemical entity at the phosphor of the internucleoside 
linkage or at the nucleobase. When the nucleobase is used for attachment of 
the chemical entity, the attachment point is usually at the 7 position of the 
1 0 purines or 7-deaza-purins or at the 5 position of pyrimidines. The nucleotide 
may be distanced from the reactive group of the chemical entity by a spacer 
moiety. The spacer may be designed such that the conformational space 
sampled by the reactive group is optimized for a reaction with the reactive 
group of the nascent encoded molecule or reactive site. 



15 



20 



The anticodon complements the codon of the identifier oligonucleotide se- 
quence and generally comprises the same number of nucleotides as the 
codon. The anticodon may be adjoined with a fixed sequence, such as a se- 
quence complementing a framing sequence. 

Various specific building blocks are envisaged. Building blocks of particular 
interest are shown below. 



Building blocks transferring a chemical entity to a recipient nucleophilic group 
25 The building block indicated below is capable of transferring a chemical entity 
(CE) to a recipient nucleophilic group, typically an amine group. The bold 
lower horizontal line illustrates the building block and the vertical line illus- 
trates a spacer. The 5-membered substituted N-hydroxysuccinimid (NHS) 
ring serves as an activator, i.e. a labile bond is formed between the oxygen 
30 atom connected to the NHS ring and the chemical entity. The labile bond 
may be cleaved by a nucleophilic group, e.g. positioned on a scaffold 
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The 5-membered substituted N-hydroxysuccinimid (NHS) ring serves as an 
activator, i.e. a labile bond is formed between the oxygen atom connected to 
5 the NHS ring and the chemical entity. The labile bond may be cleaved by a 
nucleophilic group, e.g. positioned on a scaffold, to transfer the chemical en- 
tity to the scaffold, thus converting the remainder of the fragment into a leav- 
ing group of the reaction. When the chemical entity is connected to the acti- 
vator through an carbonyl group and the recipient group is an amine, the 
10 bond formed on the scaffold will an amide bond. The above building block is 
the subject of WO03078627A2, the content of which is incorporated herein in 
their entirety by reference. 

Another building block which may form an amide bond is 

15 




CE* 



R may be absent or N0 2> CF 3 , halogen, preferably CI, Br, or I, and Z may be 
S or O. This type of building block is disclosed in WO03078626A2. The con- 
tent of this patent application is incorporated herein in the entirety by refer- 
20 ence. 
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A nucleophilic group can cleave the linkage between Z and the carbonyl 
group' thereby transferring the chemical entity -(C==0)-CE' to said nucleo- 
philic group. 

5 

Building blocks transferring a chemical entity to a recipient reactive group 
forming a C=C bond 

A building block as shown below are able to transfer the chemical entity to a 
recipient aldehylde group thereby forming a double bond between the carbon 
10 of the aldehyde and the chemical entity 




15 The above building block is disclosed in WO03078445A2, the content of 
which being incorporated herein in the entirety by reference. 

Building blocks transferring a chemical entity to a recipient reactive group 
forming a C-C bond 

20 The below building block is able to transfer the chemical entity to a recipient 
group thereby forming a single bond between the receiving moiety, e.g. a 
scaffold, and the chemical entity. 
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B 



CE 



10 



15 



The above building block is disclosed in WO03078445A2, the content of 
which being incorporated herein in the entirety by reference. 

Another building block capable of transferring a chemical entity to a receiving 
reactive group forming a single bond is 



The receiving group may be a nucleophile, such as a group comprising a 
hetero atom, thereby forming a single bond between the chemical entity and 
the hetero atom, or the receiving group may be an electronegative carbon 
atom, thereby forming a C-C bond between the chemical entity and the scaf- 
fold. The above building block is disclose in WO03078446A2, the content of 
which is incorporated herein by reference. 

The chemical entity attached to any of the above building blocks may be a 
selected from a large arsenal of chemical structures. Examples of chemical 
entities are 

H or entities selected among the group consisting of a Ci-Ce alkyl, C2-C6 al- 
kenyl, Cz-Ce alkynyl, C 4 -C 8 alkadienyl, C3-C7 cycloalkyl, C3-C7 cycloheteroal- 




o 



CE 



O 
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kyl, aryl, and heteroaryl, said group being substituted with 0-3 R 4 , 0-3 R 5 and 
0-3 R 9 or C r C 3 alkylene-NR 4 2 , d-Cs alkylene-NR 4 C(0)R 8 , d-C 3 al- 
kylene-NR 4 C(0)OR 8 , d-C 2 alkylene-0-NR 4 2 , d-C 2 alkylene-0-NR 4 C(0)R 8 , 
C r C 2 alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 . 
5 where R 4 is H or selected independently among the group consisting 

of d-C 6 alkyl, C2-C 6 alkenyl, C 2 -C 6 alkynyl, C3-C7 cycloalkyl, C3-C7 cyclohet- 
eroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , -NHOH, 
-NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of 
10 C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 4 -C 8 alkadienyl said group being substituted 
with 0-2 R 7 , 

where R 6 is selected independently from H, d-C 6 alkyl, C3.C7 cycloal- 
kyl, aryl or Ci-C 6 alkylene-aryl substituted with 0-5 halogen atoms selected 
from -F, -CI, -Br, and -I; and 

15 R 7 is independently selected from -N0 2 , -COOR 6 , -COR 6 , -CN, 

-OSiR 6 3 , -OR 6 and -NR 6 2 . 

R 8 is H, Ci-C 6 alkyl, C 2 -C 6 alkenyl, d-d alkynyl, C 3 -C 7 cycloalkyl, 
aryl or d-C 6 alkylene-aryl substituted with 0-3 substituents independently 
selected from -F, rCI, -N0 2 , -R 3 , -OR 3 , -SiR 3 3 

20 R 9 is =0, -F, -CI, .Br, -I, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , 

-NR 6 -C(0)OR 8 , -SR 6 , -S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and 
-S(0) 2 NR 6 2 . 



Cross-fink cleavage building blocks 
25 It may be advantageous to split the transfer of a chemical entity to a recipient 
reactive group into two separate steps, namely a cross-linking step and a 
cleavage step because each step can be optimized. A suitable building block 
for this two step process is illustrated below: 
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A 



Initally, a reactive group appearing on the functional entity precursor (abbre- 
viated FEP) reacts with a recipient reactive group, e.g. a reactive group ap- 
5 pearing on a scaffold, thereby forming a cross-link. Subsequently, a cleavage 
is performed, usually by adding an aqueous oxidising agent such as l 2 , Br 2 , 
Cl 2 , H + , or a Lewis acid. The cleavage results in a transfer of the group HZ- 
FEP- to the recipient moiety, such as a scaffold. 

10 In the above formula 
Z is O, S, NR 4 
Q is N, CR 1 

P is a valence bond, O, S, NR 4 , or a group Cs-yarylene, C^alkylene, 
C^O-alkylene, Ci-eS-alkylene, NR 1 -alkylene, d^alkylene-O, C^alkylene-S 
15 option said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or d-C 3 
alkylene-NR 4 2 , C^-C 3 alkylene-NR 4 C(0)R 8 , C-,-C 3 alkylene-NR 4 C(0)OR 8 , 
d-Cz alkylene-0-NR 4 2 , Q{-C 2 alkylene-0-NR 4 C(0)R 8 , C1-C2 al- 
kylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 
B is a group comprising D-E-F, in which 
20 D is a valence bond or a group C-i-ealkylene, C^alkenylene, Ci„ 

6 alkynylene, Cs-yarylene, or Cs-yheteroarylene, said group optionally being 
substituted with 1 to 4 group R 11 , 

E is, when present, a valence bond, O, S, NR 4 , or a group Ci. 
6 alkylene, C^alkenylene, Ci^alkynylene, C5-7arylene, or Cs-yheteroarylene, 
25 said group optionally being substituted with 1 to 4 group R 11 , 
F is, when present, a valence bond, O, S, or NR 4 , 
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A is a spacing group distancing the chemical structure from the com- 
plementing element, which may be a nucleic acid, 

R 1 , R 2 , and R 3 are independent of each other selected among the 
group consisting of H, C1-C6 alkyl, C2-C 6 alkenyl, C2-C 6 alkynyl, C 4 -Cs alkadi- 
5 enyl, C3-C7 cycloalkyl, C3-C7 cycloheteroalkyl, aryl, and heteroaryl, said 
group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or C1-C3 al- 
kylene-NR 4 2f d-C 3 aIkylene-NR 4 C(0)R 8 , d-C 3 alkylene-NR 4 C(0)0R 8 , d-C 2 
alkylene-0-NR 4 2, C r C 2 alkylene-0-NR 4 C(0)R 8 , d-C 2 al- 
kylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 
10 FEP is a group selected among the group consisting of H, Ci-C 6 alkyl, 

C2-C6 alkenyl, C2-C 6 alkynyl, C 4 -C 8 alkadienyl, C3-C7 cycloalkyl, C3-C7 cyclo- 
heteroalkyl, aryl, and heteroaryl, said group being substituted with 0-3 R 4 , 0-3 
R 5 and 0-3 R 9 or d-C 3 alkylene-NR 4 2l C1-C3 aikylene-NR 4 C(0)R 8 , d-C 3 al- 
kylene-NR 4 C(0)OR 8 , d-C 2 alkylene-0-NR 4 2 , Ci-C 2 alkylene-0-NR 4 C(0)R 8 , 
15 Ci-C 2 alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 

where R 4 is H or selected independently among the group consisting 
of d-C6 alkyl, C^Cs alkenyl, C^Ce alkynyl, C3-C7 cycloalkyl, C3-C7 cyclohet- 
eroalkyl, aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , -NHOH, 
20 -NHNHR 6 , -C(0)R 6 , -SnR 6 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of 
d-Ce alkenyl, d-d alkynyl, d-d alkadienyl said group being substituted 
with 0-2 R 7 , 

where R 6 is selected independently from H, d-C 6 alkyl. C3-C7 cycloal- 
kyl, aryl or d-C6 alkylene-aryl substituted with 0-5 halogen atoms selected 
25 from -F, -CI, -Br, and -I; and R 7 is independently selected from -N0 2 , 
-COOR 6 , -COR 6 , -CN, -OSiR 6 3 , -OR 6 and -NR 6 2 . 

R 8 is H, Ci-C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C3-C7 cycloalkyl, aryl or 
C1-C6 alkylene-aryl substituted with 0-3 substituents independently selected 
from -F, -CI, -N0 2 , -R 3 , -OR 3 , -SiR 3 3 
30 R 9 is =0, -F, -CI, -Br, -I, -CN, -N0 2 , -OR 6 , -NR 6 2 , -NR 6 -C(0)R 8 , 

-NR 6 -C(0)OR 8 , -SR 6 , -S(0)R 6 , -S(0) 2 R 6 , -COOR 6 , -C(0)NR 6 2 and 
-S(0) 2 NR 6 2 . 
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In a preferred embodiment Z is O or S, P is a valence bond, Q is CH, B is 
CH 2 , and R 1 , R 2 , and R 3 is H. The bond between the carbonyl group and Z is 
cleavable with aqueous l 2 . 

Contacting between target and library 

The contacting step, by which the library of bifunctional molecules is sub- 
jected under binding conditions to a target associated with a target oligonu- 
cleotide, may be referred to as the enrichment step or the selection step, as 
appropriate, and includes the screening of the library for display molecules 
having predetermined desirable characteristics. Predetermined desirable 
characteristics can include binding to a target, catalytically changing the tar- 
get, chemically reacting with a target in a manner which alters/modifies the 
target or the functional activity of the target, and covalently attaching to the 
target as in a suicide inhibitor. 

In theory, display molecules of interest can be selected based on their prop- 
erties using either physical or physiological procedures. The method pre- 
ferred according to the present invention is to enrich molecules with respect 
to binding affinity towards a target of interest. In a certain embodiment, the 
basic steps involve mixing the library of complexes with the immobilized tar- 
get of interest. The target can be attached to a column matrix or microtitre 
wells with direct immobilization or by means of antibody binding or other 
high-affinity interactions. In another embodiment, the target and displayed 
molecules interact without immobilisation of the target. Displayed molecules 
that bind to the target will be retained on this surface, while nonbinding dis- 
played molecules in a certain aspect of the invention will be removed during 
a single or a series of wash steps. The identifier oligonucleotides of com- 
plexes bound to the target can then be coupled to the target oligonucleotide. 
It may be considered advantageously to perform a chromatography step after 
or instead of the washing step, notably in cases where the target is not im- 
mobilized. After the coupling between the identifier oligonucleotide and the 
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target oligonucleotide, the coupled oligonucleotide may be recovered and 
optionally amplified before the decoding step. 

A significant reduction in background binders may be obtained with increased 
5 washing volumes, repeating washing steps, higher detergent concentrations 
and prolonged incubation during washing. Thus, the more volume and num- 
ber of steps used in the washing procedure together with more stringent con- 
ditions the more efficiently the non-binders and background binders wflf be 
removed. The right stringency in the washing step can also be used to re- 
10 move low-affinity specific binders. However, the washing step will also re- 
move wanted binders if too harsh conditions are used. 

A blocking step, such as incubation of solid phase with skimmed milk pro- 
teins or other inert proteins and/or mild detergent such as Tween-20 and Tri- 
15 ton X-100, may also be used to reduce the background. The washing condi- 
tions should be as stringent as possible to remove background binding but to 
retain specific binders that interact with the immobilized target. Generally, 
washing conditions are adjusted to maintain the desired affinity binders, e.g. 
binders in the micromolar, nanomolar, or pocomolar range. 

20 

In traditional elution protocols, false positives due to suboptimal binding and 
washing conditions are difficult to circumvent and may require elaborate ad- 
justments of experimental conditions. However, an enrichment of more than 
100 to 1000 is rarely obtained. The present invention alleviates the problem 
25 with false positive being obtained because the non-specific binding com- 
plexes to a large extent remain in solution or attached to the reaction cham- 
ber such that the indentifier oligonucleotide of non-binding complexes will be 
in a low concentration compared to the identifier oligonucleotides of binding 
complexes relative to the concentration of the target oligonucleotide. 

30 

The target can be any compound of interest. E.g. the target can be a protein, 
peptide, carbohydrate, polysaccharide, glycoprotein, hormone, receptor, an- 
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tigen, antibody, virus, substrate, metabolite, transition state analogue, cofac- 
tbr, inhibitor, drug, dye, nutrient, growth factor, cell, tissue, etc. without limita- 
tion. Suitable targets include, but are not limited to, angiotensin converting 
enzyme, renin, cyclooxygenase, 5-lipoxygenase, IIL- 1 0 converting enzyme, 
5 cytokine receptors, PDGF receptor, type II inosine monophosphate dehydro- 
genase, p-lactamases, integrin, proteases like factor Vila, kinases like Bcr- 
Abl/Her, phosphotases like PTP-1B, and fungal cytochrome P-450. Targets 
can include, but are not limited to, bradykinin, neutrophil elastase, the HIV 
proteins, including tat, rev, gag, int, RT, nucleocapsid etc., VEGF, bFGF, 
10 TGFp, KGF, PDGF, GPCR, thrombin, substance P, IgE, sPLA2, red blood 
cells, glioblastomas, fibrin clots, PBMCs, hCG, lectins, selectins, cytokines, 
ICP4, complement proteins, etc. 

Certain targets comprise one or more discrete binding domains. Proteins that 

15 contain these domains are involved in a variety of processes, such as cellular 
transporters, cholesterol movement, signal transduction and signaling func- 
tions which are. involved in development and neurotransmission, See Herz, 
Lipoprotein receptors: beacons to neurons?, (2001) Trends in Neurosciences 
24(4):193-195; Goldstein and Brown,The Cholesterol Quartet, (2001) Sci- 

20 ence 292:1310-1312. The function of a discrete binding domain is often spe- 
cific but it also contributes to the overall activity of the protein or polypeptide. 
For example, the LDL-receptor class A domain (also referred to as a class A 
module, a complement type repeat or an A-domain) is involved in ligand . 
binding while the gamma-carboxyglumatic acid (Gla) domain which is found 

25 in the vitamin-K-dependent blood coagulation proteins is involved in high- 
affinity binding to phospholipid membranes. Other discrete binding domains 
include, e.g., the epidermal growth factor (EGF)-like domain in tissue-type 
plasminogen activator which mediates binding to liver cells and thereby regu- 
lates the clearance of this fibrinolytic enzyme from the circulation and the cy- 

30 toplasmic tail of the LDL-receptor which is involved in receptormediated en- 
docytosis. 
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Individual target proteins can possess one or more discrete monomer do- 
mains, as discussed above. These proteins are often called mosaic proteins. 
For example, members of the LDL-receptor family contain four major struc- 
tural domains: the cysteine rich A-domain repeats, epidermal growth factor 
5 precursor-like repeats, a transmembrane domain and a cytoplasmic domain. 
The LDL-receptor family includes members that: 1) are cell-surface recep- 
tors; 2) recognize extracellular ligands; and 3) internalize them for degrada- 
tion by lysosomes. See Hussain et al., The Mammalian Low-Density Lipopro- 
tein Receptor Family, (1999) Annu. Rev. Nutr . 19:141-72. For example, some 

10 members include very-low-density lipoprotein receptors (VLDL-R), apolipo- 
protein E receptor 2, LDLR-related protein (LRP) and megalin. Family mem- 
bers have the following characteristics: 1) cell-surface expression; 2) ex- 
tracellular ligand binding consisting of A-domain repeats; 3) requirement of 
calcium for ligand binding; 4) recognition of receptor-associated protein and 

1 5 apolipoprotein (apo) E; 5), epidermal growth factor (EGF) precursor homol- 
ogy domain containing YWTD repeats; 6) single membrane-spanning region; 
and 7) receptor-mediated endocytosis of various ligands. See Hussain, su- 
pra. Yet, the members bind several structurally dissimilar ligands. The pre- 
sent invention offers the possibility of identifying two or more ligands against 

20 the same target as discussed elsewhere herein. 

In some aspects of the invention, a dimer compound binding with two targets 
normally interacting in a biological context is identified using the methods of 
the present invention. Examples of such targets are factor Xa and factor Vila. 

25 The method includes separately screening a library of bifunctional complexes 
and identifying suitable display molecules that binds to both targets. The two 
display molecules pools are then linked together. In the event m display 
molecules are identified having an affinity above a certain threshold (i.e. a 
low Kd) to a first target and n display molecules having an affinity above an- 

30 other or the same threshold, a dimer compound array of n times m molecules 
are formed. The dimer compound are subsequently screened for the ability to 
bind to the first and the second target molecule, thereby identifying a dimer or 
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a range of dinners that specifically bind to the first and the second target 
molecule with a certain affinity. 

A target can also be a surface of a non-biological origin, such as a polymer 
5 surface or a metal surface. The method of the invention may then be used to 
identify suitable coatings for such surfaces. 

In a preferred embodiment, the desirable display molecule acts on the target 
without any interaction between the nucleic acid attached to the desirable 
encoded molecule and the target. In one embodiment, the bound complex- 
target aggregate can be partitioned from unbound complexes prior to or sub- 
sequent to the coupling step by a number of methods. The methods include 
nitrocellulose filter binding, column chromatography, filtration, affinity chroma- 
tography, centrifugation, and other well known methods. A preferred method 
is size-exclusion chromatography. 

Briefly, the library of complexes is subjected to the target, which may include 
contact between the library and a column onto which the target is immobi- 
lised. Identifier oligonucleotides associated with undesirable display mole- 
cules, i.e. display molecules not bound to the target under the stringency 
conditions used, will pass through the column. Additional undesirable display 
molecules (e.g. display molecules which cross-react with other targets) may 
be removed by counter-selection methods. Desirable complexes are bound 
to the column. The target may be immobilized in a number of ways. In one 
embodiment, the target is immobilized through a cleavable physical link, such 
as one more chemical bonds. Following the interaction of the display mole- 
cule and the target, the respective oligonucleotides are coupled. The aggre- 
gate of the target and the complex may then be subjected to a size exclusion 
chromatography to separate the aggregate from the rest of the compounds in 
the media. 
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The complex may be provided with a cleavable linker at a position between 
the display molecule and the identifier oligonucleotide. When the target is 
immobilized the cleavable linker of the complex is preferable orthogonal to 
the cleavable linker that attached the target to the solid support. Subsequent 
5 to the optional size exclusion chromatography, this cleavable linker is 

cleaved to separate the identifier oligonucleotides of complexes having affin- 
ity towards the targets. Just to mention a single type of orthogonal cleavable 
linkages, one could attached to target to the solid support through a linkage 
that can be cleaved by a chemical agent, and the linker separating the dis- 
10 play molecule and the identifier oligonucleotide may be selected as a photo- 
cleavable linkage. More specifically, the former linkage may be a disulphide 
bond that can be cleaved by a suitable reducing agent like DTT (dithiothrei- 
tol) and the latter linkage may be an o-nitrophenyl group. 



15 There are other partitioning and screening processes which are compatible 
with this invention that are known to one of ordinary skill in the art. Such 
known process may be used in combination with the present inventive 
method. In one embodiment, the coupling products can be fractionated by a 
number of common methods and then each fraction is assayed for activity. 

20 The fractionization methods can include size, pH, hydrpphobicity, etc. 

Inherent in the present method is the selection of encoded molecules on the 
basis of a desired function; this can be extended to the selection of mole- 
cules with a desired function and specificity. Specificity can be required dur- 

25 ing the selection process by first extracting complexes which are capable of 
interacting with a non-desired "target" (negative selection, or counter- 
selection), followed by positive selection with the desired target. As an exam- 
ple, inhibitors of fungal cytochrome P-450 are known to cross-react to some 
extent with mammalian cytochrome P-450 (resulting in serious side effects). 

30 Highly specific inhibitors of the fungal cytochrome could be selected from a 
library by first removing those complexes capable of interacting with the 
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mammalian cytochrome, followed by retention of the remaining products 
which are capable of interacting with the fungal cytochrome. 

The present invention can be used to identify compounds that bind to differ- 
5 ent molecular targets. A small library of preferred compounds can be directly 
linked to an oligonucleotide that identifies the structure of the said compound. 
This can be done manually or using a more automatic system such as robotic 
equipments. These tagged compounds can then be mixed with one or more 
target molecules to select for compound and target pairs that bind to each 

10 other. For example, a library of compounds designed to bind preferably to 
protein kinases could be mixed with a library of various protein kinases to 
identify compounds that specifically interact with a certain protein kinase. The 
use of the present proximity selection procedure will in this instance generate 
an extensive structure activity relationship (SAR) where different binding 

1 5 compound are match simultaneously against different related target mole- 
cules. The above method can also be used for other target classes such as 
proteases, phosphatases, GPCRs, nuclear receptors and corresponding 
compound libraries. The information for these selections can be used to 
study the selectivity and specificity and to design sub-libraries with potential 

20 binding compounds. 

In a prior art selection where the target is immobilized to a surface, for exam- 
ple in a well or a bead, the effective concentration of the target will be high 
locally on the surface but infinitively low in the solution. This restriction will 

25 result in low recovery of binding molecules because most of the binding 

molecules are free in the solution and removed in future washing steps. The 
present invention allows selection in solution at true equilibrium conditions 
between the molecular target and the binding molecules. The invention is not 
dependent on washing- or separation-step as is most prior art selection pro- 

30 tocols. The amount of captured binding molecules can be varied using a suit- 
able concentration of the target. Most of the binding molecules with a certain 
binding constant can be captured if the target concentration is higher than the 
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binding constant of the binding molecules. Also the concentration of the tar- 
get can be adjusted to capture binders with a certain binding constant. A high 
target concentration will also increase the likelihood of selecting specific 
binders that are present in low copy number. By using high concentration of . 
> target, the solution selection can also be used to identify binders with low 
binding affinity. This is especially important with screening for small frag- 
ments which normally possess low affinity but still holds important structural 
information that can be used in second generation library design. 



10 In a certain embodiment, a binding platform may be constructed that can be 
used for almost any target. The binding platform should preferably be small 
enough to only allow association of a few or a single target molecule. This to 
ensure a solution based selection procedure with adjustable target concen- 
tration. The binding platform is primarily composed of two components; a 

15 small surface allowing association of the target molecule, and an association 
area/site for the target oligonucleotide. This binding platform may be de- 
signed to mediate the association of the target and target oligonucleotide to 
allow proximity selection in solution. 

20 Cleavable linkers 

A cleavable linker may be positioned between the target and a solid support, 
between the potential drug candidate and the identifier oligonucleotide, be- 
tween the molecular target and the target oligonucleotide or any other posi- 
tion that can provide for a separation of the identifier oligonucleotide from 

25 successful complexes from non-specific binding complexes. The cleavable 
linker may be selectively cleavable, i.e. conditions may selected that only 
cleave that particular linker. 



The cleavable linkers may be selected from a large plethora of chemical 
30 structures. Examples of linkers includes, but are not limited to, linkers having 
an enzymatic cleavage site, linkers comprising a chemical degradable com- 
ponent, and linkers cleavable by electromagnetic radiation, such as light. 
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Examples of linkers cleayable by electromagnetic radiation (light) 
o-nitrobenzyl p-alkoxy 
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o-nitrobenzyl in exo position 
N0 2 R 2 

For more details see Holmes CP. J. Org. Chem. 1997, 62, 2370-2380 

5 3-nitrophenyloxy 

R 1 — ? ^-O-P-o-p-o-R 2 

) ==/ T OH 6h 
0 2 H h . 

For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 



15 



Dansyl derivatives: 

n — ► I O 

O-S-O 




10 R 1 N 

For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 

Coumarin derivatives 
,NR 2 R 3 

hv H— NR 2 R 3 




tf-Cr^^'Cr^O H-Donor 

For more details see R. O. Schoenleber, B. Giese. Synlett 2003, 501-504 



R 1 and R 2 can be either of the potential drug candidate and the identifier oli- 
gonucleotide, respectively. Alternatively, R 1 and R 2 can be either of the target 
20 or a solid support, respectively. 



R 3 = H or OCH 3 
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If X is O then the product will be a carboxylic acid 
If X is NH the product will be a carboxamide 

One specific example is the PC Spacer Phosphoramidite (Glen research 
catalog # 10-4913-90) which can be introduced in an oligonucleotide during 
synthesis and cleaved by subjecting the sample in water to UV light (~ 300- 
350 nm) for 30 seconds to 1 minute. 



DMT = 4,4'-Dimethoxytrityl 
iPr = Isopropyl 
CNEt = Cyanoethyl 

The above PC spacer phosphoamidite is suitable incorporated in a library of 
complexes at a position between the indentifier and the potential drug candi- 
date. The spacer may be cleaved according to the following reaction. 



R 1 and R 2 can be either of the encoded molecule and the identifying mole- 
cule, respectively. In a preferred aspect R 2 is an oligonucleotide identifier and 
the R 1 is the potential drug candidate. When the linker is cleaved a phos- 
phate group is generated allowing for further biological reactions. As an ex- 
ample, the phosphate group may be positioned in the 5 'end of an oligonu- 
cleotide allowing for an enzymatic ligation process to take place. 



h 




-I 




+ 



HO-P-O-R 2 
OH 
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Examples of linkers cleavable by chemical agents: 

Ester linkers can be cleaved by nucleophilic attack using e.g. hydroxide ioris. 
In practice this can be accomplished by subjecting the target-ligand complex 
to a base for a short period. 

Rf R 4 R? R 4 

oh- o 

<A 




OH 
R 5 R 6 




10 R 1 and R 2 can be the either of be the potential drug candidate or the identifier 
oligonucleotide, respectively. R 4 ^ 6 can be any of the following: H, CN, F, N0 2 , 
S0 2 NR 2 . 

Disulfide linkers can efficiently be cleaved / reduced by Tris (2-carboxyethyl) 
15 phosphine (TCEP). TCEP selectively and completely reduces even the most 
stable water-soluble alkyl disulfides over a wide pH range. These reductions 
frequently required less than 5 minutes at room temperature. TCEP is a non- 
volatile and odorless reductant and unlike most other reducing agents, it is 
resistant to air oxidation. Trialkylphosphines such as TCEP are stable in 
20 aqueous solution, selectively reduce disulfide bonds, and are essentially un- 
reactive toward other functional groups commonly found in proteins. 



25 



R 1 -S-S-R 2 + 




R U SH + HS-R 2 + 




OH 



More details on the reduction of disulfide bonds can be found in Kirley, 
T.L.(1989), Reduction and fluorescent labeling of cyst(e)ine-containing pro- 
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teins for subsequent structural analysis, Anal. Biochem. 180, 231 and Levi- 
son, M.E., et aL (1969), Reduction of biological substances by water-soluble 
phosphines: Gamma-globulin. Experentia 25, 126-127. 

5 Linkers cleavable by enzymes 

The linker connecting the potential drug candidate with the identifier oligonu- 
cleotide or the solid support and the target can include a peptide region that 
allows a specific cleavage using a protease. This is a well-known strategy in 
molecular biology. Site-specific proteases and their cognate target amino 
10 acid sequences are often used to remove the fusion protein tags that facili- 
tate enhanced expression, solubility, secretion or purification of the fusion 
protein. 

Various proteases can be used to accomplish a specific cleavage. The speci- 
ficity is especially important when the cleavage site is presented together 
15 with other sequences such as for example the fusion proteins. Various condi- 
tions have been optimized in order to enhance the cleavage efficiency and 
control the specificity. These conditions are available and know in the art. 

Enterokinase is one example of an enzyme (serine protease) that cut a spe- 
20 cific amino acid sequence. Enterokinase recognition site is Asp-Asp-Asp- 
Asp-Lys (DDDDK), and it cleaves C-terminally of Lys. Purified recombinant 
Enterokinase is commercially available and is highly active over wide ranges 
in pH (pH 4.5-9.5) and temperature (4-45°C). 

25 The nuclear inclusion protease from tobacco etch virus (TEV) is another 

commercially available and well-characterized proteases that can be used to 
cut at a specific amino acid sequence. TEV protease cleaves the sequence 
Glu-Asn-Leu-Tyr-Phe-GIn-Gly/Ser (ENLYFQG/S) between Gln-Gly or Gln- 
Ser with high specificity. 

30 

Another well-known protease is thrombin that specifically cleaves the se- 
quence Leu-Val-Pro-Arg-Gly-Ser (LVPAGS) between Arg-Gly. Thrombin has 
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also been used for cleavage of recombinant fusion proteins. Other se- 
quences can also be used for thrombin cleavage; these sequences are more 
or less specific and more or less efficiently cleaved by thrombin. Thrombin is 
a highly active protease and various reaction conditions are known to the 
5 public. 

Activated coagulation factor FX (FXa) is also known to be a specific and use- 
ful protease. This enzyme cleaves C-terminal of Arg at the sequence He-Glu- 
Gly-Arg (IEGR). FXa is frequently used to cut between fusion proteins when 
10 producing proteins with recombinant technology. Other recognition se- 
quences can also be used for FXa. 

Other types of proteolytic enzymes can also be used that recognize specific 
amino acid sequences. In addition, proteolytic enzymes that cleave amino 
15 acid sequences in an un-specific manner can also be used if only the linker 
contains an amino acid sequence in the complex molecule. 

Other type of molecules such as ribozymes, catalytically active antibodies, or 
lipases can also be used. The only prerequisite is that the catalytically active 
20 molecule can cleave the specific structure used as the linker, or as a part of 
the linker, that connects the encoding region and the displayed molecule or, 
in the alternative the solid support and the target. 

A variety of endonucleases are available that recognize and cleave a double 
25 stranded nucleic acid having a specific sequence of nucleotides. The en- 

donuclease Eco Rl is an example of a nuclease that efficiently cuts a nucleo- 
tide sequence linker comprising the sequence GAATTC also when this se- 
quence is close to the nucleotide sequence length. Purified recombinant Eco 
Rl is commercially available and is highly active in a range of buffer condi- 
30 tions. As an example the Eco Rl is working in in various protocols as indicted 
below (NEBuffer is available from New England Biolabs): 
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NEBuffer 1 : [10 mM Bis Tris Propane-HCI, 10 mM MgCI2, 1 mM dithiothreitol 
(pH 7.0 at 25°C)], 

NEBuffer 2 : [50 mM NaCI, 10 mM Tris-HCI, 10 mM MgCI2, 1 niM dithiothrei- 
tol (pH 7.9 at 25°C)], 
5 NEBuffer 3 : [100 mM NaCI, 50 mM Tris-HCI, 10 mM MgCI2, 1 mM dithio- 
threitol (pH 7.9 at 25°C)], 

NEBuffer 4 : [50 mM potassium acetate, 20 mM Tris-acetate, 10 mM magne- 
sium acetate, 1 mM dithiothreitol (pH 7.9 at 25°C)]. 

Extension buffer : mM KCI, 20 mM Tris-HCI(Ph 8.8 at 25o C), 10 mM (NH4 )2 
10 SQ4 , 2 mM MgSO 4 and 0.1% Triton X-100, and 200 pM dNTPs. 



Determining the identifier oligonucleotide sequence 

The nucleotide sequence of the identifier sequence present in the coupled 

1 5 product is determined to identify the identity of the binding display mole- 
cule^) and7or optionally the molecular target(s). In a certain embodiment of 
the invention, chemical entities that participated in the formation of the dis- 
play molecules that binds to the target are identified. The synthesis method 
of the display molecule may be established if information on the chemical 

20 entities as well as the point in time they have been incorporated in the display 
molecule can be deduced from the identifier oligonucleotide. It may be suffi- 
cient to obtain information on the chemical structure of the various chemical 
entities that have participated in the display molecule to deduce the full mole- 
cule due to structural constraints during the formation. As an example, the 

25 use of different kinds of attachment chemistries may ensure that a chemical 
entity on a building block can only be transferred to a single position on a 
scaffold. Another kind of chemical constrains may be present due to steric 
hindrance on the scaffold molecule or the chemical entity to be transferred. In 
general however, it is preferred that information can be inferred from the 

30 identifier oligonucleotide sequence that enable the identification of each of 
the chemical entities that have participated in the formation of the encoded 
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molecule along with the point in time in the synthesis history the chemical 
entities have been incorporated in the (nascent) display molecule. 



Although conventional DNA sequencing methods are readily available and 
5 useful for this determination, the amount and quality of isolated bifunctional 
molecule may require additional manipulations prior to a sequencing reac- 
tion. 

Where the amount is low, it is preferred to increase the amount of the cou- 
10 pled oligonucleotide sequence by polymerase chain reaction (PCR) using 

PCR primers directed to primer binding sites present in the identifier oligonu- 
cleotide sequence. 



In one embodiment, the different coupled oligonucleotide sequences are 
1 5 cloned into separate sequencing vectors prior to determining their sequence 
by DNA sequencing methods. This is typically accomplished by amplifying 
the different coupled oligonucleotide sequences by PCR and then using a 
unique restriction endonuclease sites on the amplified product to directionally 
clone the amplified fragments into sequencing vectors. The cloning and se- 
20 quencing of the amplified fragments then is a routine procedure that can be 
carried out by any of a number of molecular biological methods known in the 
. art. 



Alternatively, the bifunctional complex or the PCR amplified identifier oli- 
25 gonucleotide sequence can be analysed in a microarray. The array may be 
designed to analyse the presence of a single codon or multiple codons in a 
identifier oligonucleotide sequence. 

In still another approach, the coupled oligonucleotide product is analysed by 
30 QPCR. Preferably, the QPCR affords information as to the chemical moieties 
that has participated in the formation of the display molecules and optionally 
the identity of the target. The QPCR approach also allows a direct investiga- 
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tion of the enrichment factor If two samples are analysed In parallel, one with 
target and the other with the target plus library. The difference in signal from 
these to samples will illustrate how much coupling that is due to the target 
mediated coupling compared to the background coupling. Various conditions 
5 can be investigated to obtain the most optimal selection procedure before the 
sequences are analysed to identify the precise structures of the binding 
molecules. 

Synthesis of nucleic acids 
10 Oligonucleotides can be synthesized by a variety of chemistries as is well 
known in the art. For synthesis of an oligonucleotide on a substrate in the 
direction, of 3' to 5', a free hydroxy terminus is required that can be conven- 
iently blocked and deblocked as needed. A preferred hydroxy terminus block- 
ing group is a dimexothytrityl ether (DMT). DMT blocked termini are first de- 
15 blocked, such as by treatment with 3% dichloroacetic acid in dichloro- 

methane (DCM) as is well known for oligonucleotide synthesis, to form a free 
hydroxy terminus. 

Nucleotides in precursor form for addition to a free hydroxy terminus in the 
direction of 3' to 5' require a phosphoramidate moiety having an aminodiiso- 
propyl side chain at the 3' terminus of a nucleotide. In addition, the free hy- 
droxy of the phosphoramidate is blocked with a cyanoethyl ester (OCNET), 
and the 5' terminus is blocked with a DMT ether. The addition of a 5' DMT-, 3' 
OCNET-blocked phosphoramidate nucleotide to a free hydroxyl requires 
tetrazole in acetonitrile followed by iodine oxidation and capping of unreacted 
hydroxyls with acetic anhydride, as is well known for oligonucleotide synthe- 
sis. The resulting product contains an added nucleotide residue with a DMT 
blocked 5' terminus, ready for deblocking and addition of a subsequent 
blocked nucleotide as before. 

For synthesis of an oligonucleotide in the direction of 5' to 3', a free hydroxy 
terminus on the linker is required as before. However, the blocked nucleotide 



20 



25 



30 



WO 2005/026387 PCT/DK2004/000630 

112 

to be added has the blocking chemistries reversed on its 5' and 3' termini to 
facilitate addition in the opposite orientation. A nucleotide with a free 3' hy- 
droxyl and 5' DMT ether is first blocked at the 3' hydroxy terminus by reaction 
with TBS-CI in imidazole to form a TBS ester at the 3' terminus. Then the 
5 DMT-blocked 5' terminus is deblocked with DCA in DCM as before to form a 
free 5* hydroxy terminus. The reagent (N,N-diisopropylamino)(cyanoethyl) 
phosphonamidic chloride having an aminodiisopropyl group and an OCNET 
ester is reacted in tetrahydrofuran (THF) with the 5' deblocked nucleotide to 
form the aminodiisopropyl-, OCNET-blocked phosphonamidate group on the 
10 5' terminus. Thereafter the 3' TBS ester is removed with tetrabutylammonium 
fluoride (TBAF) in DCM to form a nucleotide with the phosphonamidate- 
blocked 5' terminus and a free 3' hydroxy terminus. Reaction in base with 
DMT-CI adds a DMT ether blocking group to the 3' hydroxy terminus. 

15 The addition of the 3' DMT-, 5' OCNET-blocked phosphonamidated nucleo- 
tide to a linker substrate having a free hydroxy terminus then proceeds using 
the previous tetrazole reaction, as is well known for oligonucleotide polymeri- 
zation. The resulting product contains an added nucleotide residue with a 
DMT-blocked 3' terminus, ready for deblocking with DCA in DCM and the 

20 addition of a subsequent blocked nucleotide as before. 

Brief Description of the Figures 

Fig. 1 discloses an embodiment for proximity-dependent selection, 
Fig. 2 discloses different approaches for accomplishing coupling, 
25 Fig. 3 discloses four different approaches for producing a coupling 
Fig. 4 discloses a library versus library screening, 

Fig. 5 discloses an embodiment in which a target oligonucleotide association 
is performed in solution. 

Fig. 6 discloses a target oligonucleotide association on cell surface. 
30 Fig. 7 discloses a target with multiple binding sites which may associate with 
members of the bifunctional molecule library. 



WO 2005/026387 PCT7DK2004/000630 

1.13 

Fig. 8 discloses a target with one binding site for association with a pair of 
displayed molecules. 

Fig. 9 discloses a 2nd generation-library driven proximity selection. 
Fig. 10 discloses multiple targets for simultaneously subjected to a library of 
5 complexes. 

Detailed Disclosure of the Figures 

Fig. 1 outlines an embodiment for a proximity-dependent selection (PDS). 
The molecular target is linked to a target oligonucleotide, which in some em- 

1 0 bodiment may be unique for the target molecule. This target sequence 

comes in close proximity with a specific identifier oligonucleotide when the 
displayed molecule of a bifunctional complex.binds to the target molecule. 
This proximity will promote the coupling between the bifunctional complex 
molecules that bind to the target compare to bifunctional complex in solution. 

15 Thus, there will be a selection for coupling products that contain display 
molecules that possess affinity for the target molecule. The final ligation 
product is amplified using two primers that only amplify ligated products. 

In a first step, the target associate with a target oligonucleotide is mixed with 
20 a library of complexes, in which each complex comprises a display molecule 
attached to an identifier oligonucleotide. The display molecules are then in- 
cubated with the target. The display molecules which have an affinity towards 
. the molecular target will bind, while the complexes not having affinity will re- 
main in solution. Subsequent to the incubation, a connector oligonucleotide is 
25 added. The connector oligo nucleotide comprises parts that hybridise to se- 
quences near the ends of the target and the identifier oligonucleotides, re- 
spectively. Subsequent to the addition of the connector oligonucleotide, a 
ligation is effected by chemical or enzymatic means. Preferably a ligase is 
used to ligate the target and the identifier oligonucleotides together. The con- 
30 nector oligonucleotide is generally added in excess to saturate the com- 
plexes in solution to avoid unspecific ligation. 
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After the ligation, the ligation product is amplified by PCR. Thus, a forward 
primer is annealed to the ligation product at the 3' end thereof and extended 
using a polymerase. The transcribed product comprises a site to which a 
second (or reverse) primer can anneal so as to provide for an extension of 
5 the second primer using the transcribed product as template. Using forward 
and reverse primers as indicated above together with a polymerase and suit- 
able substrates produces amplicons, which comprises information about the 
display molecule as well as the molecular target. The figated product can be 
introduced into a host organism using a suitable vector. The host vector may 
10 be allowed to form colonies and the colonies can be sequenced to establish 
the identity of the display molecule. 

Fig. 2 shows various options to perform coupling between the target oligonu- 
cleotide and the identifier oligonucleotide. A. The ligation is promoted using a 

15 connector oligonucleotide that anneals both to the target oligonucleotide and 
the identifier oligonucleotide. The connector oligonucleotide is designed such 
that the ends of the identifier oligonucleotide and the target oligonucleotides 
are abutted. A ligase is subsequently allowed to ligate the ends together. B. 
A connector oligonucleotide is used to promote fill in of a gap using a poly- 

20 merase and finally ligation using a ligase. C. The distal end of the target oli- 
gonucleotide overlaps the distal end of the identifier oligonucleotide, which 
allows a polymerase to extend the target oligonucleotide as well as the identi- 
fier oligonucleotide thereby forming a double stranded product. D. Blunt- 
ended ligation of single-stranded or double stranded DNA using a suitable 

25 enzyme like T4 DNA ligase. 

Fig. 3 shows various methods for preparing a coupling area on an existing 
bifunctional complex. Conjugates between molecular targets associated with 
an oligonucleotide and complexes comprising a display molecule and an 
30 identifier oligonucleotide can be modified to allow a ligase to couple the oli- 
gonucleotides together. A. The identifier oligonucleotide is extended with a 
primer with an overhang that creates the coupling area. The extension is 
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suitably conducted before the selection process to obtain the benefit of a 
double stranded nucleotide sequence. A target oligonucleotide can be ligated 
to the blunt end of the extended primer or a connector oligonucleotide can be 
used to connect the target oligonucleotide and the extended primer prior to 
5 ligation with a suitable ligase. B. The identifier oligonucleotide is annealed to 
a primer that binds internally. The primer is subsequently extended, suitably 
before the selection process. The extension forms a coupling area directly on 
the identifier oligonucleotide, which allows a target oligonucleotide to be an- 
nealed and ligated. C. The first step is identical to the procedure as describe 

10 in B but the target sequence has a free 5'-end that allow ligation to the 3-end 
of the identifier oligonucleotide. A blunt ended single stranded ligation can be 
performed. Alternatively, this variation can be performed using a connector 
oligonucleotide and subsequent ligation. D. A primer is annealed to a identi- 
fier oligonucleotide and extended to produce a double-stranded DNA which is 

15 subsequently cut with an enzyme (e.g. restriction enzyme) to produce a sin- 
gle-stranded DNA portion that can be used as handle in the coupling proc- 
ess. 

Fig. 4 shows a library versus library selection method. Different targets spe- 
20 cifically encoded by the attached target oligonucleotides are mixed with a 
library of bifunctional complexes. The displayed molecules will bind to spe- 
cific targets and promote the ligation through the proximity effect. This liga- 
tion will connect the target oligonucleotides with oligonucleotides that en- 
codes for specific displayed molecules. The ligated oligonucleotides can be 
25 amplified and determined by sequencing procedures well known in the art. 

The ligated sequences will reveal which display molecules that bind to which 
target. 

Fig. 5 discloses inter alia the association of the target oligonucleotide to the 
30 target. One way of associating the target oligonucleotide with the target 

molecule is to link the oligonucleotide through a tag introduced on the target 
molecule. The tag can be attached before the target is produced (e.g. a short 
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amino acid sequence such as HIS-tag of FLAG-tag) or be modified after the 
target is produced. The target sequence can then be associated through the 
tag using a tag-binding molecule such as an antibody or other type of mole- 
cules that binds to the tag. 

5 

Fig. 6 discloses target oligonucleotide association on a cell surface. Specific 
receptors can be engineered to express a specific tag on the cell surface. 
Different tags can be used such as HIS- or FLAG-tags or other types of tags 
that become bound with the receptor. The tag will only be displayed on the 

1 0 cell surface together with the specific receptor. The target oligonucleotide is 
then associated with the receptor target using a mediator molecule that car- 
ries the target oligonucleotide and binds to the tag. A mediator molecule 
could be an antibody that binds to the tag (e.g. anti-HlS or anti-FLAG anti- 
bodies) that is associated with the target oligonucleotide. This procedure will 

15 specifically associate the target oligonucleotide with a receptor target on the 
cell surface which will promote a ligation between oligonucleotides of the 
binding displayed molecules and the target oligonucleotide. 

Fig. 7 shows a target molecule with several sites for binding of ligands. The 
20 target is subjected to a library of complexes of bifunctional molecules. Dis- 
play molecules of the complexes binds to the discrete sites of the molecular 
target thus promoting a high local concentration of the ends of the oligonu- 
cleotides which have bound to the target. Subsequently a connector oligonu- 
cleotide is added to adjoin the distal ends of the oligonucleotides together. 
25 Usually, the connector oligonucleotide is added in excess to saturate the 
ends of the identifier oligonucleotides free in the solution. The ends of the 
oligonucleotides kept together by the connector oligonucleotide are ligated 
together forming a coupled product. The coupled product is amplified by PCR 
using primers annealing to each end of the coupled product. The amplified 
30 coupled product is decoded to identify the display molecules which have 

bound to the in the target. In a step not shown on the figure, the two binding 
display molecules are coupled together via a suitable linker to form a ligand 
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which binds to two sites of the target. Suitable, the dimer comprising the two 
revealed display molecules and the linker is synthesised by organic synthe- 
. sis. 

5 Libraries of bifunctional complexes can also" be screened against each other 
using the present invention. Such an embodiment allows the detecting of 
pairs of displayed molecules that bind to the same target at different or the 
same binding site or pair of displayed molecules that bind to different targets. 
The power of the screening libraries in the above fashion is indicated by the 
10 fact that a library of e.g. 10 4 different displayed molecules generates a total 
combination of display molecules of 10 8 when pair of binders are searched 
for. 

Fig. 8 discloses a library of bifunctional complexes which is presented to a 

15 target having a site possible to be occupied by two display molecules. Ini- 
tially, the target is mixed with the library of bifunctional complexes under con- 
ditions which promote a binding interaction to take place. A first bifunctional 
complex associates with the target to form the target associated with the tar- 
get oligonucleotide. Subsequently to or simultaneously with the binding of the 

20 first display molecule, a second complex binds to the same site of the target. 
The display molecules may or may not be reacted with each other to form a 
covalent linkage between the display molecules. In another embodiment, the 
two display molecules are connected via a suitable linker or reacted with an 
external reactant so as to form a single molecule. After the binding interaction 

25 of the library of complexes with the target, the ends of the complexes which 
comprises display molecules that binds to the target are joint together. In an 
aspect of the invention, the ends are joined together using a connector 
polynucleotide. The connector polynucleotide is preferably added in excess 
to saturate ends of identifier oligonucleotides which are not part of a binding 

30 complex. After the hybridisation event between the ends of the identifier oli- 
gonucleotides and the connector oligonucleotide a ligation is conducted. 
Suitably the ligation is performed by a ligase to form a coupled product, 
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which can be used as a template by a polymerase. After the ligation, the 
coupled product is amplified by PCR to form PCR amplicons comprising in- 
formation of the display molecules which have participated in the binding in- 
teraction. 

Fig. 9 discloses a two (or more) step identification method. In a first step the 
method as disclosed in figure 1 is conducted and in the second step a new 
library prepared upon the knowledge harvested in the first library is used to 
generate the second generation library. Initially, a library of complexes are is 
presented to a target having a binding site. In the library, display molecules 
haying a binding affinity above a certain threshold is not present, illustrated 
on the drawing with a display molecule only having a partial fit in the binding 
site of the target. In the synthesis of the second generation library compo- 
nents used in the synthesis of the low binding display molecule are shuffled 
with further components and/or the low binding display molecule is added or 
subtracted a structural unit. As-an example, a further round of addition of 
chemical entities can be conducted. For systems for complex generation rely- 
ing on the natural translation system a deletion, alteration or addition of nu- 
cleic acid can be performed. The second generation library is presented to 
the a target again. In the event the alteration of the initial low binding mole- 
cule has been successful display molecules are generated which binds with a 
higher affinity towards the target. 

Fig. 10 discloses two targets which are attached to each other prior to the 
mixing with the library of complexes. The attachment can be natural, i.e. the 
association between target 1 and target 2 occur in a biological context or the 
attachment can be artificial, i.e. the association between target 1 and target 2 
is obtained by a chemical synthesis. In the latter instance, the association 
between the targets may be obtained by any chemical or enzymatic means 
which ensure a linkage. The association of the targets may also be obtained 
by expressing target 1 and target 2 as a fusion protein, i.e. a single protein 
having two distinct targets or monomer domains. In an embodiment, one of 
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the targets in the fusion protein is a capturing protein, like streptavidin. In the 
event the library is spiked with complexes having a ligand against the captur- 
ing protein, like biotin, it is feasible to form a connection between the fusion 
protein and a member of the library. The further functionality, i.e. target 2, of 
5 the fusion protein may be then be subjected to a screening process to find 
binder from the library. 

During the mixing step, the two attached targets are contacted with the library 
of complexes under binding conditions. The library may be spiked with a 

10 complex comprising a compounds known to bind to the one of the targets in 
order to find suitable binders against another target. If the library comprises 
suitable binding display molecules, two ends of the binding complexes is po- 
sitioned in close proximity. The addition of a connector oligonucleotide en- 
sures that the ends are kept close together when a ligase is allowed to per- 

15 form the action of ligating the ends together. The resulting PGR product 

comprises genetic information which encodes both the display molecules that 
have participated in the binding interaction with target 1 and target 2. 

20 

EXAMPLES illustrating the second aspect of the present invention 

Example 1 : 

Oligonucleotide sequences 
25 Target Sequence (ES-1) 

5'-X-TAGTC GATGT AGCTA GCTAG TGCGC CAATG CCTTA TCAGC 

Identifier Sequence (IS-1) (extension part) 

S'-GATCG ATGAC TGACG CCGGT AAATCTACCGTCTAAGCTG- Y- 3 ' 

30 

Underlined sequence is reverse primer binding site 
Control Identifier Sequence (CIS-1) (extension part) 
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5'-GATCG ATGAC TGACG CCGGT gracgrt cgtag afcafcc gatgc 
AAATCTACCGTCTAAGCTG-Z-3 ' 

Underlined sequence is reverse primer 

5 

Connector Sequence (CS-1 ) 

5 ' -AAAAGGAATAGTCG-CTAGCTACTGTTTT 

Primers (Forward PR-1 and reverse PR-2) 

10 

PR-l: 5'-TAGTC GATGT AGCTA GCTAG 
PR-2: 5'-CAGCT TAGAC GGTAG ATTT 

Target labelling with oligonucleotide sequence. 

15 The target molecule (streptavidin) is modified with an oligonucleotide se- 
quence using a terminus modifier that allow direct coupling to the target(s) 
molecule. The oligonucleotide sequence ES-1 is synthesised with the 5'- 
Thiol-Modifier (Glen Research, #10-1926-90) to produce a oligonucleotide 
that can be coupled to the target(s) through the 5'-end (designated X in the 

20 ES-1). 



N(iPr) 2 



NEt 



5'-Thiol-Modifier C6 



25 



30 



The covalent attachment of the oligonucleotide at the target is earned out 
with the aid of the heterobispecific crosslinker Sulfosuccinimidyl-4-(N- 
maleimidomethyl)cyclohexane-1-carboxylate (sSMMCC). The e-amino 
groups of lysine side chains of the target(s) are first derivatized with sSMCC 
cross linker to provide a maleimide functionality, which subsequently is re- 
acted with the thiolated oligonucleotide. 
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Protecol: Dissolve approx. 2 mg of sSMCC in 60 \si of DMF. Add the sSMCC 
solution to 200 jxl of a 100 jiM solution of streptavidin in PBS buffer pH 7.3 
and incubate in the dark at room' temperature for about 1 hour. The excess 

5 sSMCC is removed using NAP5 or NAP1 0 (Pharmacia) using a PBSE buffer. 
The thiolated oligo (ES-1) is activated in TE buffer, pH 7.4 using 1 mM DDT 
and excess removed using spin column (BioRad). The activated streptavidin 
and oligonucleotide are preferably used directly in the cross linking reaction 
by mixing and incubation for about 1 hour in the dark at room temperature. 

0 The modified streptavidin is preferably purified on a size-exclusion column or 
a anion-exchange column (MonoQ HR5/5, Pharmacia). 

Bifunctional complex molecules. 

Bifunctional complexes are preferably molecules that are composed of a nu- 
5 cleotide sequence that encodes for the displayed molecule. These com- 
plexes can be generated using various procedures, as disclosed elsewhere 
herein. The bifunctional complexes preferably contain an oligonucleotide that 
can be ligated or otherwise connected to the oligonucleotide sequence on the 
target mediated by the binding of the display molecule to the target. 

0 

This example describes the bifunctional complex as the IS-1 sequence which 
is labelled with a biotin in the 3*end (designated Y in the sequence). This oli- 
gonucleotide is synthesized using Biotin-dT (Glen Research, # 10-1038-95), 
which will function as the display molecule and the oligonucleotide sequence 
5 encoding the biotin moiety. The displayed biotin molecule has high affinity for 
streptavidin which will bring the coding oligonucleotide (CS-1) in close prox- 
imity of the identifier sequence (IS-1). This will promote the ligation between 
the ES-1 and the IS-1 oligonucleotides as shown below. The ligation is medi- 
ated by the connector sequence (CS-1 ). 
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A different bifunctional complex with a Fluorescein as display molecule is 
also present. The fluorescein will not bind to the target molecule (strepta- 
5 vidin) resulting in no proximity ligation between the ES-1 and the IS-1 oli- 
gonucleotides. 



Selection through proximity ligation 

10 20 pM bifunctional complex (IS-1) is mixed with 100 pM conjugated target 
molecule to allow binding of the biotin molecule to streptavidin. Another bi- 
functional complex (CIS-1) was used as a control (100 pM) with a different 
display molecule (Fluorescein) included in the synthesis of the oligonucleo- 
tide as a Fluorescein-dT (Glen Research, # 10-1056-95). This displayed 

15 molecule is encoded by another unique sequence in the CIS-1 oligonucleo- 
tide. However, the ligation region is identical to the IS-1 oligonucleotide allow- 
ing ligation if proximity is achieved. The CIS-1 oligonucleotide is 10 nucleo- 
tides longer permitting the distinction from the IS-1 oligonucleotide by running 
an agorose gel and determining the length of the oligonucleotides. The mix- 

20 tures were incubated in 50 mM KCI, 10 mM Tris-HCI. pH 8.3, 1.5 mM MgCI 2 , 
0.15 mM ATP. pH 7.4 for 1 hour to allow association of the Biotin (or Fluo- 
rescein) to streptavidin. 

The connector sequence (CS-1) is then added at 400 nM concentration to 
25 promote the connection between the ES-1 and IS-1 oligonucleotides together 
with 2 U T4 DNA ligase to start the ligation. The reaction was keep at 30°C 
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for 5 min and then 80°C for 20 min. The relatively high concentration of the 
connector oligonucleotide will saturate all IS-1 that have not been brought 
into proximity of an encoding sequence. 

5 Amplification of the ligated products was performed with the polymerase 

chain reaction (PCR) using primers corresponding to the 5'-end of the encod- 
ing sequence (PR-1 : 5-TAGTC GATGT AGCTA GCTAG) and the 3'-end of 
the identifier oligonucleotide sequence (PR-2: 5-CAGCT TAGAC GGTAG 
ATTT). The primers are design to only amplify the ligated product. PCR was 

10 performed using Ready-To-Go (RTG) PCR beads (Amersham Biosciences) 
and 10 pmol of each primer in a reaction volume of 25 pi. The PCR reaction 
consisted of an initial denaturation step of 94°C for 2 minutes followed by 20- 
45 cycles of denaturation at 94°C for 30 seconds, annealing at 58°C for 1 
minute and extension at 72°C for 1 minute. A final extension step of 2 min- 

15 utes at 72°C was included. The PCR products were resolved by agarose gel 
electrophoresis and the band corresponding to the expected size was cut 
from the gel and purified using QIAquick Gel Extraction Kit (QIAGEN). 

The different length of the Biotin and Fluorescein identifier oligonucleotide 
20 sequences (IS-1 and CIS-1) is used to verify that the bifunctional complex 
with Biotin have been enriched through the binding to streptavidin. 
This example describes a selection using two different bifunctional com- 
plexes. The same approach can be used for larger libraries, at least up to 
10 14 different molecules. The same approach is used when screening library 
25 versus library. In this case the targets are encoded by different sequences 
but with identical coupling area. 

Cloning/sequencing 

To sequence individual PCR fragments the purified PCR products were 
30 cloned into the pCR4-TOPO vector (Invitrogen) according to the manufac- 
turer's instructions. The resulting mixture was used for transformation of 
TOP10 E. coli cells (Invitrogen) using standard procedures. The cells were 
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plated on growth medium containing 100 pg/ml ampicillin and left at 37°C for 
12-16 hours. Individual E.coli clones were picked and transferred to PCR 
wells containg 50 pi water. These wells were then boiled for 5 minutes and 
20 Ml mixture from each well was used in a PCR reaction using RTG PCR 
5 beads and 5 pmol each of M1 3 forward and reverse primers according to the 
manufacturer's instructions. A sample of each PCR product was then treated 
with Exonuclease I (USB) and Shrimp Alkaline Phosphatase (USB) to re- 
move degrade single stranded DNA and dNTPs and sequenced using the 
DYEnamic ET cycle sequencing kit (Amersham Biosciences) according to 
10 the manufacturer's instructions and the reactions were analyzed on a 

MegaBace 4000 capillary sequencer (Amersham Biosciences). Sequence 
outputs were analyzed with ContigExpress software (Informax Inc.). 



Example 2: Library versus library 

15 A library of bifunctional complexes is screened against another library of en- 
coded peptides or proteins. Examples of other encoded peptides are ribo- 
some displayed peptides or mRNA displayed peptides. This example de- 
scribes the use of bifunctional complexes together with mRNA displayed tar- 
gets to perform library versus library screening. Any other library, where the 

20 genotype is associated with the phenotype, can be used together with bifunc- 
tional complexes as described in this invention. 

A freshly transcribed mRNA (0.5 - 2.5 nmol) is prepared from an appropriate 
library. The transcribed mRNA library is hybridized to biotinylated puromycin- 

25 linker (about 0.5 nmol) in 300 pi binding buffer (30 mM Tris, pH 7.0, 250 mM 
NaCI) by heating to 85° C for .30 sec followed by cooling to 4° C in 5 min. 1 00 
pi pre-washed Neutravidin beads (Pierce) is then added to the hybridization 
mixture and incubated for 30 min at 4 °C under rocking. Subsequently, the 
beads are washed in 3x 100 pi binding buffer followed by centrifugation to 

30 remove the liquid phase. The moist beads were then irradiated for 15 min at 
room temperature with a 25W UV-lamp (Pyrex-filter, A > 300nm). Subse- 
quently, the beads are washed with 100 pi plain water to yield the photo- 
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crosslinked mRNA-puromycin template which is directly used for mRNA- 
peptide fusion formation in rabbit reticulocyte lysate to produce the mRNA- 
protein fusions library according to the literature (Kurz et la., Nucleic Acids 
Res. 2000, 28:83). The puromycin-linker is also prepared according to Kurz 
5 et al. and biotinylated by carbamate bond formation between the puromycin 
amino group (50 pM) and the photo-cleavable biotin-reagent (NHS-PC-Biotin, 
5 mM, EZ-Link ™-Biotin, Pierce Chemicals) in 25% DMSO/water for 2 h at 
room temperature followed by NaCI/EtOH precipitation. 

10 The mRNA-peptide fusion library is then converted by a suitable primer to 

contain a coupling area. The coupling area is formed by a primer partly com- 
plementary to the distal region of the mRNA and which is able to promote an 
extension on the mRNA strand with the coupling area. This coupling area is 
then used in the selection procedure to couple the target oligonucleotide with 

15 the identifier oligonucleotide mediated by the binding of displayed molecules 
to the mRNA-peptide fusions. 

1-100 pmol Afunctional complex library molecules are mixed with 1-100 pmol 
mRNA-peptide fusion library in a binding buffer (50 mM KCI, 10 mM Tris-HCI. 

20 pH 8.3, 1 .5 mM MgCI 2 , 0.15 mM ATP. pH 7.4) for 1 hour to allow binding of 
the displayed molecules to the mRNA-peptide fusion molecules. The connec- 
tor sequence (an oligonucleotide the is complementary to the coupling area 
is then added at about 400 nM concentration to promote the connection be- 
tween the target and identifier oligonucleotides. Subsequently, 2 U T4 DNA 

25 ligase is added to start the ligation. The reaction is kept at 30°C for 5 min and 
then 80°C for 20 min. Amplification of the ligated products is performed using 
a polymerase chain reaction (PCR) using primers corresponding to the 5'- 
end of the target oligonucleotide and the 3-end of the identifier oligonucleo- 
tide. The primers are design to only amplify the ligated product. PCR is per- 

30 formed using Ready-To-Go (RTG) PCR beads (Amersham Biosciences) and 
10 pmol each primer in a reaction volume of 25 pi. The PCR reaction con- 
sisted of an initial denaturation step of 94°C for 2 minutes followed by 20-45 
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cycles of denaturation at 94°C for 30 seconds, annealing at 58°C for 1 min- 
ute and extension at 72°C for 1 minute. A final extension step of 2 minutes at 
72°C was included. The PCR products are resolved by agarose gel electro- 
phoresis and the band corresponding to the' expected size is cut from the gel 
5 and purified using QIAquick Gel Extraction Kit (QIAGEN). 

The invention has been described in detail with reference to preferred em- 
bodiments thereof. However, it will be appreciated that those skilled in the 
art, upon consideration of this disclosure, may make modifications and im- 
1 0 provements within the spirit and scope of the invention. 



